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(54) Title: NOVEL COFACTORS OF THE PREGNANE X RECEPTOR AND METHODS OF USE 

(57) Abstract: The present invention relates to novel cofactors of the Pregnane X Receptor which we call CF1, CF2, CF3, CF4 
and CF 44 the isolated nucleic acid sequences thereof and the isolated proteins thereof. The invention further relates to processes 
for isolating and/or producing the nucleic acids or the proteins as well as methods of use of these cofactors, such as inhibiting or 
activating the binding of the cofactors to PXR. 
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NOVEL COFACTORS OF THE PREGNANE X RECEPTOR AND METHODS OF 
USE 



BACKGROUND OF THE INVENTION 

Multicellular organisms are dependent on advanced mechanisms of information , 
transfer between cells and body compartments. The information that is transmitted 
can be highly complex and can result in the alteration of genetic programs involved in 
cellular differentiation, proliferation, or reproduction. The signals, or hormones, are 
often simple molecules, such as peptides, fatty acid, or cholesterol derivatives. 

Many of these signals produce their effects by ultimately changing the transcription of 
specific genes. One well-studied group of proteins that mediate a cell's response to a 
variety of signals is the family of transcription factors known as nuclear receptors, 
hereinafter referred to frequently as "NR". Members of this group include receptors 
for steroid hormones, vitamin D, ecdysone, cis and trans retinoic acid, thyroid 
hormone, bile acids, cholesterol-derivatives, fatty acids (and other peroxisomal 
proliferated), as well as so-called orphan receptors, proteins that are structurally 
similar to other members of this group, but for which no ligands are known (Escriva, 
H. et al., Ligand binding was acquired during evolution of nuclear receptors, PNAS, 
94, 6803 - 6808, 1997). Orphan receptors may be indicative of unknown signaling 
pathways In the cell or may be nuclear receptors that function without ligand 
activation. The activation of transcription by some of these orphan receptors may 
occur in the absence of an exogenous ligand and/or through signal transduction 
pathways originating from the cell surface (Mangelsdorf, D. J. et al., The nuclear 
receptor superfamily: the second decade, Cell 83, 835-839, 1995). 

In general, three functional domains have been defined in NRs. An amino terminal 
domain is believed to have some regulatory function. A DNA-binding domain 
hereinafter referred to as "DBD" usually comprises two zinc finger elements and 
recognizes a specific Hormone Responsive Element hereinafter referred to as "HRE" 
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within the promoters of responsive genes. Specific amino acid residues in the "DBD B 
have been shown to confer DN A sequence binding specificity (Schena, M. & , 
Yamamoto, K.R., Mammalian Glucocorticoid Receptor Derivatives Enhance 
Transcription in Yeast, Science, 241:965-967, 1988). A Ligand-binding-domain 
hereinafter referred to as "LBD" is at the carboxy-terminal region of known NRs. In 
the absence of hormone, the LBD appears to interfere with the interaction of the DBD 
with its HRE. Hormone binding seems to result in a conformational change in the NR 
and thus opens this interference (Brzozowski et al., Molecular basis of agonism and 
antagonism in the oestogen receptor, Nature, 389, 753 - 758, 1997; Wagner et al., A 
structural role for hormone in the thyroid hormone receptor, Nature, 378, 690 - 697. 
1995). A NR without the HBD constitutively activates transcription but at a low level. 

Both the amino-terminal domain and the LBD of the NR appear to havetranscription 
activation functions hereinafter referred to as "TAP. Acidic residues in the amino- 
terminal domains of some nuclear receptors may be important for these transcription 
factors to interact with RNA polymerase. TAF activity may be dependent on 
interactions with other protein factors or nuclear components (Diamond et al., 
Transcription Factor Interactions: Selectors of Positive or Negative Regulation from a 
Single DNA Element, Science, 249:1266-1272 , 1990). Certain oncoproteins (e.g., c- 
Jun and c-Fos) can show synergistic or antagonistic activity with glucocorticoid 
receptors (GR) In transfected cells. Furthermore, the receptors for estrogen and 
vitamins A and D, and fatty acids have been shown to interact, either physically or 
functionally, with the Jun and Fos components of AP-1 in the transactivation of 
steroid- or AP-1 regulated genes. 

Coactivators or transcriptional activators are proposed to bridge between sequence 
specific transcription factors, the basal transcription machinery and in addition to 
influence the chromatin structure of a target cell. Several proteins like SRC-1 , AGTR, 
and Gripl , which are also cofactors of NRs similar to those disclosed in this 
invention, interact with NRs in a ligand enhanced manner (Heery et al., A signature 
motif in transcriptional coactivators mediates binding to nuclear receptors, Nature, 
387, 733 - 736; Heinzel et al., A complex containing N-CoR, mSin3 and histone 
deacetylase mediates transcriptional repression, Nature 387, 43-47, 1997). 
Furthermore, the physical interaction with negative receptor-interacting proteins or 
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corepressors has been demonstrated (Xu et al., Coactivator and Corepressor 
complexes in nuclear receptor function, Curr Opin Genet Dev, 9 (2), 140 - 147, 
1999). 

Nuclear receptor modulators like steroid hormones affect the growth and function of 
specific cells by binding to intracellular receptors and forming nuclear receptor-ligand 
complexes. Nuclear receptor-hormone complexes then interact with a hormone 
response element (HRE) in the control region of specific genes and alter specific 
gene expression. 

Over the past decade, new members of the nuclear hormone gene family have been 
identified that lack known ligands. These orphan receptors can be used to uncover 
signaling molecules that regulate yet unidentified physiological networks. Some of 
, these orphan receptors are constitutively active and transactivate target genes 
without the need to interact with a ligand (Mangelsdorf et al, 1 995). 

The present invention relates to the identification of novel cofactors of the pregnane 
X receptor (hereinafter referred to as PXR). Note that the human PXR is sometimes 
also referred to as SXR, for simplicity, we will solely use the name PXR, also for the 
human protein or gene. PXR is a recently identified orphan nuclear receptor that 
combines features of nuclear receptors of both the steroid and nonsteroid subfamily: 
like nonsteroid receptors, PXR binds as a heterodimer with RXR to the HREs of the 
respective hormone responsive genes. However, interestingly PXR is effectively 
activated by several steroids, including the naturally occuring pregnanes as well as 
synthetic glucocorticoids and antiglucocorticoids (Kliever et al., Cell 92, 73 (1998)). 
PXR is abundantly expressed in only a small subset of tissues, predominantly in liver 
and intestine. Evidence has been provided that PXR acts as a key transcriptional 
regulator of the genes for cytochrome P450 (CYP) monooxygenases of the 3A 
subfamily (Moore et al., PNAS 97, 7500 (2000); Kliever et al., Cell 92, 73 (1998). 
These hepatic monooxygenases metabolise steroid hormones, including 
corticosteroids, progestins and androgens as well as a whole range of drugs and 
xenobiotics. PXR binds to the CYP3 promoter and the transactivating function of 
PXR is activated by a range of xenobiotics known to induce CYP3 expression, 
including rifampicin, RU486, phenoarbital, and pregnenolone. Notably, it has been 
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shown recently that PXR is activated very efficiently by hyperforin, a constituent of 
the herbal remedy St. John's wort, which is widely used for the treatment of 
depression (Moore et aL PNAS 97, 7500 (2000)). 

At present it appears that PXR is involved in a novel steroid hormone signaling 
pathway with implications in the regulation of steroid hormone and sterol 
homeostasis. Therefore the identification of PXR cofactor proteins that might mediate 
PXR transactivation activity could provide means for the treatment of numerous 
diseases or pathophysiological symptoms as a consequence of endocrine 
malfunction. Furthermore, the examination of PXR variants as well as variants of the 
PXR interacting proteins could provide valuable clues with respect to the ability of a 
person to metabolise a certain drug. The kowledge of the genetic background of a 
person could also help to predict drug-drug interactions. 

The present invention provides novel proteins, nucleic acids, and methods useful for 
developing and identifying compounds for the treatment of such diseases and 
disorders as metabolic disorders, immunological indications, hormonal dysfunctions 
and/or neurosystemic diseases and others not specifically mentioned here. 

In preferred embodiments of the invention methods are disclosed for testing whether 
certain compounds promote the interaction of the newly disclosed PXR cofactor 
proteins with PXR, allowing conclusions on the effects of the compound on PXR 
activity, and thus on the induction of proteins important for the degradation of 
xenobiotics, such as the CYP3 proteins. 

These novel proteins interact In vivo with the pregnane x receptor and shall 
hereinafter collectively be referred to as "cofactors". 

Identified and disclosed herein are protein sequences for novel cofactors and the 
nucleic acid sequences encoding these cofactors, which we call: CF1, CF2, CF3, 
CF4 and CF 44, or simply collectively „CFs\ 

The importance of this invention is manifested in the effects of the CFs to modulate 
genes involved in cellular functions like regulation of metabolism and cell 
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homeostasis, cell proliferation and differentiation, pathological cellular aberrations, or 
cellular defense mechanisms. 

Thus, the CF proteins are useful for screening for pregnane x receptor, thereby 
providing for agents which may influence the activety of PXR as well as in further 
preferred embodiments of the invention RXR and thus thereby transcriptionally 
induced P450 CYP mono oxygenases of the 3A subfamily. 

In one aspect of the present invention, we provide isolated nucleic acid sequences 
for novel CFs. In particular, we provide the cDNA sequences encoding the human 
CFs. 

These nucleic acid sequences have a variety of uses. For example, they are useful 
for making vectors and for transforming cells, both of which are ultimately useful for 
production of the CF proteins. 

They are also useful as scientific research tools for developing nucleic acid probes 
for determining expression levels of the cofactor genes, e.g., to identify diseased or 
otherwise abnormal states. They are useful for developing analytical tools such as 
anti sense oligonucleotides for selectively inhibiting expression of the cofactor genes 
to'determine physiological responses. 

In another aspect of the present invention, we provide a homogenous composition 
comprising the cofactor proteins. The protein is useful for screening drugs for agonist 
and antagonist activity, and, therefore, for screening for drugs useful in regulating 
physiological responses associated with the cofactors according to the Invention. 
Specifically, antagonists to the CFs could be used to treat metabolic disorders, 
immunological indications, hormonal dysfunctions, neurosystemic diseases. The 
proteins are also useful for developing antibodies for detection of the proteins. 

Flowing from the foregoing are a number of other aspects of the invention, including 
(a) vectors, such as plasmids, comprising the cofactor nucleic acid sequences that 
may further comprise additional regulatory elements, e.g., promoters, (b) transformed 
cells that express the cofactors, (c) nucleic acid probes, (d) antisense 
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oligonucleotides, (e) agonists, (f) antagonists, and (g) transgenic mammals. Further 
aspects of the invention comprise methods for making and using the foregoing 
compounds and compositions. 

The foregoing merely summarizes certain aspects of the present invention and is not 
intended, nor should it be construed, to limit the invention in any manner. All patents 
and other publications recited herein are hereby incorporated by reference in their 
entirety. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

THE CF1, CF2, CF3, CF4 AND CF 44 PROTEINS AND NUCLEIC ACIDS: 

The present invention comprises, in part, novel CF cofactors of PXR. Particularly 
preferred embodiments of these cofactors are those having an amino acid sequence 
substantially the same as SEQ ID NO. 3 for CF1, SEQ ID NO. 6 forCF2, SEQ ID 
NO. 9 for CF3. SEQ ID NO. 12 for CF 4 and SEQ ID NO. 28 for CF 44 (see also 
Figures). 

As used herein, if reference to the cofactor is made or the cofactor n X",wherein "X" 
stands for the number designating the cofactor, it is meant as a reference to any 
protein having an amino acid sequence substantially the same as SEQ ID NO. 3 for 
CF1 , SEQ ID NO. 6 for CF2, SEQ ID NO. 9 for CF3, SEQ ID NO. 12 for CF 4 and 
SEQ ID NO. 30 for CF 44 (see also Figures). 

The present invention also comprises the nucleic acid sequences encoding the 
cofactors 1 to 4 which nucleic acid sequences are substantially the same as SEQ ID 
NO. 1 for CF1 , SEQ ID NO. 4 for CF2, SEQ ID NO. 7 for CF3, SEQ ID NO. 10 for CF 
4 and SEQ ID NO. 28 for CF 44 (see also Figures) all encoding human cofactors as 
preferred embodiments and/or the complements thereof as shown in SEQ ID NO. 2 
for CF1 , SEQ ID NO. 5 for CF2, SEQ ID NO. 8 for CF3, SEQ ID NO. 1 1 for CF 4 and 
SEQ ID NO. 29 for CF 44 (see also Figures). 
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Herein the "complement" refers to the complementary strand of the nucleic acid 
according to the invention, thus the strand that would hybridize to the nucleic acid 
according to the invention. In accordance with standard biological terminology all 
DNA sequences herein are however written in 5-3' orientation, thus the 
complements depicted (see also figures) are actually "reverse" complements (as also 
stated in the figures). For simplification purposes they are however some times 
referred to simply as "complements". One skilled in the art, given the DNA 
sequence(s) would be able to create the correct reverse complement. 

As used herein, a protein "having an amino acid sequence substantially the same as 
SEQ ID NO x" (where "x" is the number of one of the protein sequences recited in the 
Sequence Listing) means a protein whose amino acid sequence is the same as SEQ 
ID NO x or differs only in a way such that at least 50% of the residues compared in a 
sequence alignment with SEQ ID NO. x are identical, preferably 75% of the residues 
are identical, even more preferably 95% of the residues are identical and most 
preferably at least 98% of the residues are identical 

Those skilled in the art will appreciate that conservative substitutions of amino acids 
can be made without significantly diminishing the protein's affinity for interacting 
proteins, DNA binding sites, cofactor modulators, e.g. small molecular hydrophobic 
compounds, or RNA. 

Other substitutions may be made that increase the proteins' affinity for these 
compounds. Making and identifying such proteins is a routine matter given the 
teachings herein, and can be accomplished, for example, by altering the nucleic acid 
sequence encoding the protein (as disclosed herein), inserting it into a vector, 
transforming a cell, expressing the nucleic acid sequence, and measuring the binding 
affinity of the resulting protein, all as taught herein. 

As used herein the term "a molecule having a nucleotide sequence substantially the 
same as SEQ ID NO y" means a nucleic acid encoding a protein "having an amino 
acid sequence substantially the same as SEQ ID NO y" as defined above. This 
definition is intended to encompass natural allelic variations in the CF sequences. 
Cloned nucleic acid provided by the present invention may encode CF proteins of 
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any species of origin, including (but not limited to), for example, mouse, rat, rabbit, 
hamster, cat, dog, pig, primate, and human. 

Preferably the nucleic acids provided by the invention encode CFs of mammalian, 
preferably mouse and most preferably human origin. 

IDENTIFICATION OF VARIANTS AND HOMOLOGIES AS WELL AS USE OF 
PROBES: 

Nucleic acid hybridization probes provided by the invention are nucleic acids 
consisting essentially of the nucleotide sequences complementary to any sequence 
depicted in SEQ ID NO. 1 , SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10 and SEQ 
ID NO. 28 and/or the complemets thereof as shown in SEQ ID NO. 2, SEQ ID NO. 5, 
SEQ ID NO. 8, SEQ ID NO. 11 and SEQ ID NO. 29, or parts thereof which are 
effective in nucleic acid hybridization 

Nucleic acid hybridization probes provided by the invention are nucleic acids capable 
of detecting i.e. hybridizing to the gene encoding the polypeptides according to SEQ 
ID NO. s: 3, 6, 9, 12 and 30. 

Nucleic acid probes are useful for detecting CF gene expression in cells and tissues 
using techniques well-known in the art, including, but not limited to, Northern blot 
hybridization, in situ hybridization/and Southern hybridization to reverse 
transcriptase - polymerase chain reaction product DNAs. The probes provided by the 
present invention, including oligonucleotide probes derived therefrom, are also useful 
for Southern hybridization of mammalian, preferably human, genomic DNA for 
screening for restriction fragment length polymorphism (RFLP) associated with 
certain genetic disorders. As used herein, the term complementary means a nucleic 
acid having a sequence that is sufficiently complementary in the Watson-Crick sense 
to a target nucleic acid to bind to the target under physiological conditions or 
experimental conditions those skilled in the art routinely use when employing probes. 
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It is understood in the art that a nucleic acid sequence will hybridize with a 
complementary nucleic acid sequence under high stringent conditions as defined 
herein, even though some mismatches may be present Such closely matched, but 
not perfectly complementary sequences arealso encompassed by the present 
invention. For example, differences may occur through genetic code degeneracy, or 
by naturally occurring or man made mutations and such mismatched sequences 
would still be encompassed by the present claimed invention. 

Preferably, the nucleotide sequences of the nuclear cofactors SEQ ID NOs:1, 4, 7, 
10 and 28 and/or their complements SEQ ID NO.s 2, 5, 8, 1 1 and 29 can be used to 
derive oligonucleotide fragments (probes) of various length. Stretches of 17 to 30 
nucleotides are used frequently but depending on the screening parameters longer 
sequences as 40, 50, 100, 150 up to the full length of the sequence may be used. 
Those probes can be synthesized chemically and are obtained readily from 
commercial oligonucleotide providers. Chemical synthesis has improved over the 
years and chemical synthesis of oligonucleotides as long as 100-200 bases is 
possible. The field might advance further to allow chemical synthesis of even longer 
fragments. Alternatively, probes can also be obtained by biochemical de novo 
synthesis of single stranded DNA. In this case the nucleotide sequence of the 
nuclear receptors or their complements serve as a template and the corresponding 
complementary strand is synthesized. A variety of standard techniques such as nick 
translation or primer extension from specific primers or short random oligonucleotides 
can be used to synthesize the probe (Sambrook, J., Fritsch, E.F. & Maniatis , T. 
Molecular cloning: a laboratory manual. Cold Spring Harbor Press, Cold Spring 
Harbor, 1 989)). Nucleic acid reproduction technologies exemplified by the 
polymerase chain reaction (Saiki, R.K. eta/. Primer-directed enzymatic amplification 
of DNA with a thermostable DNA polymerase. Science 239, 487-491 (1988)) are 
commonly applied to synthesize probes, in the case of techniques using specific 
primers the nucleic acid sequences of the nuclear receptors or their complements are 
not only used as a template in the biochemical reaction but also to derive the specific 
primers which are needed to prime the reaction. 
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In some cases one might also consider to use the nucleic acid sequence of the 
cofactors or their complements as a template to synthesize an RNA probe. A 
promoter sequence for a DNA-dependent RNA polymerase has to be introduced at 
the 5'-end of sequence. As an example this can be done by cloning the sequence in 
a vector which carries the respective promoter sequence. It is also possible to 
introduce the needed sequence by synthesizing a primer with the needed promoter in 
the form of a 5' "tail". The chemical synthesis of a RNA probe is another option. 

vAppropriate.means are available to detect the event of a hybridization. There is a 
wide variety of labels and detection systems, e.g. radioactive isotopes, fluorescent, or 
chemiluminescent molecules which can be linked to the probe. Furthermore, there 
are methods of introducing haptens which can be detected by antibodies or other 
ligands such as the avidin/biotin high affinity binding system. 

Hybridization can take place in. solution or on solid phase or in combinations of the 
two, e.g. hybridization in solution and subsequent capture of the hybridization product 
onto a solid phase by immobilized antibodies or by ligand coated magnetic beads. 

Hybridization probes act by forming selectively duplex molecules with complementary 
stretches of a sequence of a gene or a cDNA. The selectivity of the process can be 
controlled by varying the conditions of hybridization. To select sequences which are 
identical highly homologous to the sequence of interest stringent conditions for the 
hybridization will be used, e.g. low salt in the range of 0.02 M to 0.15 M salt and/or 
high temperatures in the range from 50°C degrees centigrade to 70°C degrees 
centigrade. Stringency can be further improved by the addition of formamide to the 
hybridisation solution. The use of stringent conditions which means that only little 
mismatch or a complete match will lead to a hybridization product would be used to 
isolate closely related members of the same gene family. Thus, as used herein 
stringent hybridization conditions are those where between 0.02 M to 0.15 M salt 
and/or high temperatures in the range from 50°C degrees centigrade to 70°C 
degrees centigrade are applied. 

The use of highly stringent conditions or conditions of "high stringency" means that 
only very little mismatch or a complete match which lead to a hybridization product 
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would be used to isolate very closely related members of the same gene family.- - 
Thus, as used herein highly stringent hybridization conditions are those where 
between 0.02 - 0.3 M salt and 65°C degrees centigrade are applied for about 5 to 18 
hours of hybridization time and additionally, the sample filters are washed twice for 
about 15 minutes each at between 60°C - 65°C degrees centigrade, wherein the first 
washing fluid contains about 0.1 M salt (NaCI and/or Sodium Citrate) and the second 
contains only about 0.02 M salt (NaCI and/or Sodium Citrate). In a preferred 
embodiment the following conditions are considered to be highly stringent: 

Hybridisation in a buffer containing 2 x SSC (0.03 M Sodium Citrate, 0.3 M NaCI) at 
65°C - 68°C degrees centigrade for 12 hours, followed by a washing step for 15 
minutes in 0.5 x SSC, 0.1% SDS, and a washing step for 15 minutes at 65°C degrees 
centigrade in 0.1 x SSC, 0.1% SDS. 

Less stringent hybridization conditions, e.g. 0.15 M salt - 1 M salt and/or 
temperatures from 22°C degrees centigrade to 56°C degrees centigrade are applied 
in order to detect functionally equivalent genes in the same species or for 
orthologous sequences from other species. 

Unspecific hybridization products are removed by washing the reaction products 
repeatedly in 2 x SSC solution and increasing the temperature. 

DEGENERATE PGR AND CLONING OF HOMOLOGUES 

The nucleotide sequences of the cofactors CF1 to CF4 or their complements can be 
used to design primers for a polymerase chain reaction. Due to the degeneracy of 
the genetic code the respective amino acid sequence Is used to design 
oligonucleotides in which varying bases coding for the same amino acid are included. 
Numerous design rules for degenerate primers have been published (Compton et at, 
1990). As In hybridization there are a number of factors known to vary the stringency 
of the PCR. The most important parameter is the annealing temperature. To allow 
annealing of primers with imperfect matches annealing temperatures are often much 
lower than the standard annealing temperature of 55°C, e.g. 35°C to 52°C degrees 
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can be chosen. PCR reaction products can be cloned. Either the PCR product is : 
cloned directly, with reagents and protocols from commercial manufacturers (e.g. 
from Invitrogen, San Diego, USA). Alternatively, restriction sites can be introduced 
intro the PCR product via a S'-tail of the PCR primers and used for cloning. 

GENETIC VARIANTS 

Fragments from the nucleotide sequence of the cofactors ortheir complements can 
be used to cover the whole sequence with overlapping sets of PCR primers. These 
primers are used to produce PCR products using genomic DNA from a human 
diversity panel of healthy individuals or genomic DNA from individuals which are 
phenotypically conspicuous. The PCR products can be screened for polymorphisms, 
for example by denaturing gradient gel electrophoresis. binding to proteins detecting 
mismatches or cleaving heteroduplices or by denaturing- high-performance liquid 
chromatography. Products which display mutations need to be sequenced to identify 
the nature of the mutation. Alternatively, PCR products can be sequenced directly 
omitting the mutation screening step to identify genetic polymorphisms. If genetic 
variants are identified and are associated with a discrete phenotype, these genetic 
variations can be included in diagnostic assays. The normal variation of the human 
population is of interest in designing screening assays as some variants might 
interact better or worse with a respective lead, i.e. therapeutic or potentially 
therapeutic substance (a pharmacodynamic application). Polymorphisms or 
mutations which can be correlated to phenotypic outcome are a tool to extend the 
knowledge and the commercial applicability of the nucleotide sequences of the 
claimed cofactors or their complements or their gene products, as variants might 
have a slightly different molecular behavior or desired properties. Disease-causing 
mutations or polymorphisms allow the replacement of this disease inducing gene 
copy with a wild-type copy by means of gene therapy approaches and/or the 
modulation of the activity of the gene product by drugs. Disease-causing mutations or 
polymorphisms in the CF cofactors allow predictions on the induction of the CYP3 
genes by certain substances, and thus on the degradation of drugs in the body. 

PREPARATION OF POLYNUCLEOTIDES: 
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DNA which encodes the cofactor may be obtained, in view of the instant disclosure, 
by chemical synthesis, by screening reverse transcripts of mRNA from appropriate 
cells or cell line cultures, by screening genomic libraries from appropriate cells, or by 
combinations of these procedures, as illustrated below. 

Screening of mRNA or genomic DNA may be carried out with oligonucleotide probes 
generated from the CF nucleotide sequences information provided herein. 

Probesmay be labeled with a detectable group such as a fluorescent group, a 
radioactive atom or a chemiluminescent group in accordance with known procedures 
and used in conventional hybridization assays, as described in greater detail in the 
Examples below. Alternatively, the CF nucleotide sequences may be obtained by use 
of the polymerase chain reaction (PCR) procedure, with thePCR oligonucleotide 
. primers being produced from the CF nucleotide sequences provided herein, 
according to SEQ ID NO 1 , SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10 and SEQ 
ID NO. 28 and/or the complements thereof as shown in SEQ ID NO. 2, SEQ ID NO. 
5, SEQ ID NO. 8, SEQ ID NO. 11 and SEQ ID NO. 29, or parts thereof. 

Upon purification or synthesis, the nucleic acid according to the invention may be 
labeled, e.g. for use as a probe. 

As single and differential labeling agents and methods, any agents and methods 
which are known in the art can be used. For example, single and differential labels 
may consist of the group comprising enzymes such as p-galactosidase, alkaline 
phosphatase and peroxidase, enzyme substrates, coenzymes, dyes, chromophores, 
fluorescent, chemiluminescent and bioluminescent labels such as FITC, Cy5, Cy5.5, 
Cy7, Texas-Red and IRD40(Chen et at. (1993), J. Chromatog. A 652: 355-360 and 
Kambara et al. (1992), Electrophoresis 13: 542-546), ligands or haptens such as 
biotin, and radioactive isotopes such as 3 H, 35 S, 32 P 125 l and 14 C. 

EXPRESSION OF THE CF PROTEI N/POL YPETI DES: 

The CF nucleic acids or polypeptides may be synthesized in host cells transformed 
with a recombinant expression construct comprising a nucleic acid encoding any of 
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the cofactors according to the invention. Such a recombinant expression construct 
can also be comprised of a vector that is a replicable DNA construct. 

Amplification vectors do not require expression control domains. Ail that is needed is 
the ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants. See, Sambrook et al., 
Molecular Cloning: A Laboratory Manual (2nd Edition, Cold Spring Harbor Press, 
New York, 1989). 

An expression vector comprises a polynucleotide operatively linked to a prokaryotic 
promoter. Alternatively, an expression vector is a polynucleotide operatively linked to 
an enhancer-promoter that is a eukaryotic promoter, and the expression vector 
further has a polyadenylation signal that is positioned 3* of the carboxy-terminal 
amino acid and within a transcriptional unit of the encoded polypeptide. A promoter is 
a region of a DNA molecule typically within about 500 nucleotide pairs in front of 
(upstream of) the point at which transcription begins (i.e., a transcription start site). In 
general, a vector contains a replicon and control sequences which are derived from 
species compatible with the host cell. The vector ordinarily carries a replication site, 
as well as marking sequences which are capable of providing phenotypic selection in 
transformed cells. 

Another type of discrete transcription regulatory sequence element is an enhancer. 
An enhancer provides specificity of time, location and expression level for a particular, 
encoding region (e.g., gene). A major function of an enhancer is to increase the level 
of transcription of a coding sequence in a cell. 

As used herein, the phrase "enhancer-promoter" means a composite unit that 
contains both enhancer and promoter elements. An enhancer-promoter is operatively 
linked to a coding sequence that encodes at least one gene product. 

An enhancer-promoter used in a vector construct of the present invention may be 
any enhancer-promoter that drives expression in a prokaryotic or eukaryotic cell to be 
transformed/transfected. 
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A coding sequence of an expression vector is operatively linked to a transcription 
terminating region. RNA polymerase transcribes an encoding DNA sequence through 
a site where polyadenylation occurs. 

An expression vector that comprises a polynucleotide that encodes polypeptides of 
the cofactors. Such a polynucleotide is meant to include a sequence of nucleotide 
bases encoding a CF polypeptide sufficient in length to distinguish said segment from 
a polynucleotide segment encoding a lion- cofactor polypeptide. 

A polypeptide of the invention may also encode biologically functional polypeptides or 
peptides which have variant amino acid sequences, such as with changes selected 
based on considerations such as the relative hydropathic score of the amino acids 
being exchanged. 

Thesevariant sequences are those isolated from natural sources or induced in the 
sequences disclosed herein using a mutagenic procedure such as site-directed 
mutagenesis. 

Furthermore, an expression vector of the present invention may contain regulatory 
elements for optimized translation of the polypeptide in prokaryotic or eukaryotic 
systems. These sequences are operatively located around the transcription start site 
and are most likely similar to ribosome recognition sites like prokaryotic ribosome 
binding sites (RBS) or eukaryotic Kozak sequences as known in the art (Kozak M., 
Initiation of translation in prokaryotes and eukaryotes. Gene 234, 187-208 (1999)). 

An expression vector of the present invention is useful both as a means for preparing 
quantities of the CFs 1 polypeptide-encoding DNA itself, and as a means for preparing 
the encoded CFs* polypeptide and peptides. It is contemplated that where cofactor 
polypeptides of the invention are made by recombinant means, one may employ 
either prokaryotic or eukaryotic expression vectors as shuttle systems. 

Where expression of recombinant CF1, CF2, CF3, CF4 or CF30 polypeptide is 
desired and a eukaryotic host is contemplated, it Is most desirable to employ a vector 
such as a plasmid, that incorporates a eukaryotic origin of replication. Additionally, for 
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the purposes of expression in eukaryotic systems, one desires to position the 
cofactor encoding sequence or if desired parts thereof adjacent to and under the 
control of an effective eukaryotic promoter. To bring a coding sequence under control 
of a promoter, whether it is eukaryotic or prokaryotic, what is generally needed is to 
position the 5' end of the translation initiation side of the proper translational reading 
frame of the polypeptide between about 1 and about 2000 nucleotides 3' of or 
downstream with respect to the promoter chosen. 

Furthermore, where eukaryotic expression is anticipated, one would typically desire 
to incorporate into the transcriptional unit which includes the CF polypeptide, an 
appropriate polyadenylation side. 

The invention provides homogeneous compositions of mammalian cofactor 
polypeptides produced by transformed prokaryotic or eukaryotic cells as provided 
herein. Such homogeneous compositions are intended to be comprised of 
mammalian cofactor protein that comprises at least 90% of the protein in such 
homogenous composition. The invention also provides membrane preparation from 
cells expressing the mammalian cofactors polypeptides as the result of 
transformation with a recombinant expression construct, as described here. 

Within the scope of the present invention the terms recombinant protein or coding 
sequence both also include tagged versions of the proteins depicted in SEQ ID NO. 
3, SEQ ID NO. 6, SEQ ID NO. 9, SEQ ID NO. 12 and SEQ ID NO. 30 and/or and 
fusion proteins of said proteins with any other recombinant protein. Tagged versions 
here means that small epitopes of 3-20 amino acids are added to the original protein 
by extending the coding sequence either at the 5'or the 3'terminus leading to N- 
terminal or C-terminal extended proteins respectively, or that such small epitopes are 
included elsewhere in the protein. The same applies for fusion proteins where the 
added sequences are coding for longer proteins, varying between 2 and 100 kDa. 
Tags and fusion proteins are usually used to facilitate purification of recombinant 
proteins by specific antibodies or affinity matrices or to increase solubility of 
recombinant proteins within the expression host. Fusion proteins are also of major 
use as essential parts of yeast two hybrid screens for interaction partners of 
recombinant proteins. 
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Tags used in the scope of the present invention may include but are not limited to the 
following: EEF (alpha Tubulin), B-tag (QYPALT), E tag (GAPVPYPDPLEPR) c-myc 
Tag (EQKLISEEDL), Flag epitope (DYKDDDDK, HA tag (YPYDVPDYA), 6 or 10 x 
His Tag, HSV ( QPELAPEDPED), Pk-Tag (GKPIPNPLLGLDST), protein C 
(EDQVDPRLIDGK), 17 (MASMTGGQQMG), VSV-G (YTDIEMNRLGK), Fusion 
proteines may include Thioredoxin, Glutathiontransferase (GST), Maltose binding 
Protein (MBP), Cellulose Binding protein, calmodulin binding protein, chitin binding 
, 'protein, ubiquitin, the Fc part of Immunoglobulins, and thelgGfbinding domain of 

Staphylococcus aureus protein A. These examples of course are illustrative and not 

/ • 

limiting. 

^For expression of recombinant proteins in living cells or organisms, vector constructs 
harboring recombinant cofactors as set forth in SEQ ID NO. 1, SEQ ID NO. 4, SEQ 
ID NO. 7, SEQ ID NO. 10 and SEQ ID NO. 28,are transformed or transfected into 
appropriate host cells. Preferably, a recombinant host cell of the present invention is 
transfected with a polynucleotide of SEQ ID NO. 1, SEQ ID NO. 4, SEQ ID NO. 7, 
SEQ ID NO. 10 and SEQ ID NO. 28. 

Means of transforming or transfecting cells with exogenous polynucleotide such as 
DNA molecules are well known in the art and include techniques such as calcium- 
phosphate- or DEAE-dextran-mediated transfection, protoplast fusion, 
electroporatipn, liposome mediated transfection, direct microinjection and virus 
infection (Sambrook et al., 1 989). 

The most frequently applied technique for transformation of prokaryotic cells is 
transformation of bacterial cells after treatment with Calciumchloride to increase 
permeability (Dagert & Ehrlich, 1979), but a variety of other methods is also available 
for one skilled in the art. 

The most widely used method for transfection of eukaryotic cells is transfection 
mediated by either calcium phosphate or DEAE-dextran. Although the mechanism 
remains obscure, it is believed that the transfected DNA enters the cytoplasm of the 
cell by endocytosis and is transported to the nucleus. Depending on the cell type, up 
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to 90% of a population of cultured cells may be transfected at any one time. Because 
of its high efficiency, transfection mediated by calcium phosphate or DEAE-dextran is 
the method of choice for studies requiring transient expression of the foreign nucleic 
acid in large numbers of cells. Calcium phosphate-mediated transfection is also used 
to establish cell lines that integrate copies of the foreign DNA, which are usually 
arranged in head-to-tail tandem arrays into the host cell genome. 

In the protoplast fusion method, protoplasts derived from bacteria carrying high 
numbers of copies of a plasmid of interest are mixed directly with cultured 
mammalian cells. After fusion of the cell membranes (usually with polyethylene 
glycol), the contents of the bacterium are delivered into the cytoplasm of the 
mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast 
fusion is not as efficient as transfection for. many of the cell lines that are commonly 
used for transient expression assays, but it is useful for cell lines in which 
endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple 
copies of the plasmid DNA tandemly integrated into the host chromosome. 

The application of brief, high-voltage electric pulses to a variety of mammalian and 
plant cells leads to the formation of nanometer-sized pores in the plasma membrane. 
DNA is taken directly into the cell cytoplasm either through these pores or as a 
consequence of the redistribution of membrane components that accompanies 
closure of the pores. Electroporation may be extremely efficient and may be used 
both for transient expression of cloned genes and for establishment of cell lines that 
carry integrated copies of the gene of interest. Electroporation, in contrast to calcium 
. phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell 
lines that carry one, or at most a few, integrated copies of the foreign DNA. 

Liposome transfection involves encapsulation of DNA and RNA within liposomes, 
followed by fusion of the liposomes with the cell membrane. The mechanism of how 
DNA is delivered into the cell is unclear but transfection efficiencies may be as high 
as 90%. 

Direct microinjection of a DNA molecule into nuclei has the advantage of not 
exposing DNA to cellular compartments such as low-pH endosomes. Microinjection 
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is therefore used primarily as a method to establish lines of cells that carry integrated 
copies of the DNA of interest. ^ 

The use of adenovirus as a vector for cell transaction is well known in the art. 
Adenovirus vector-mediated cell transfection has been reported for various cells 
(Stratford-Perricaudet et al., 1992). 

A transfected cell may be prokaryotic or eukaryotic, transfection.may be transient or 
stable. Where it is of interest to produce a full length humamCF;1 ,;CF2, CF3, CF4 or 
CF 44 protein, cultured mammalian or human cells are of particular interest. 

In another aspect, the recombinant host cells of the present invention are prokaryotic 
host cells. In addition to prokaryotes, eukaryotic microbes, such as yeast may also be 
,used illustrative examples for suitable cells and organisms for expression of 
recombinant proteins are belonging to but not limited to the following examples: 
Insect cells, such as Drosophila Sf21, SF9 cells or others, Expression strains of 
Escherichia coli, such as XL1 blue, BRL21 , M15, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Hanserilua polymorpha and Pichia pastoris strains, 
immortalized mammalian cell lines such as AtT-20, VERO and HeLa cells, Chinese 
hamster ovary (CHO) cell lines, and W138, BHK, COSM6, COS-7, 293 and MDCK 
cells, BHK-21 cells, Att 20HeLa cells, HeK'294, T47 D cells and others. 

Expression of recombinant proteins within the scope of this invention can also be* 
performed in vitro. This may occur by a two step procedure, thereby producing first 
mRNA by in vitro transcription of an apt polynucleotide construct followed by in vitro 
translation with convenient cellular extracts. These cellular extracts may be 
reticulocyte lysates but are not limited to this type. In vitro transcription may be 
performed by T7 or SP6 DNA polymerase or any other RNA polymerase which can 
recognize per se or with the help of accessory factors the promoter sequence 
contained in the recombinant DNA construct of choice. Alternatively one of the 
recently made available one step coupled transkription/translation systems may be 
used for in vitro translation of DNA coding for the proteins of this invention. One 
illustrative but not limiting example for such a system is the TNT® T7 Quick System 
by Promega. 
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Expression of recombinant proteins in transfected cells may occur constitutively or 
upon induction. Procedures depend on the Cell/vector combination used and are well 
known in the art. 

In all cases, transfected cells are maintained for a period of time sufficient for 
expression of the recombinant cofactor proteins accordingto the invention. A suitable 
maintenance time depends strongly on the cell type and organism used and is easily 
ascertainable by one skilled in the art. Typically, maintenance time Js from about 2 
hours to about 14 days. For the same reasons and forsake of protein stability and 
solubility incubation temperatures during maintenance time may vary from 20°C to 42 
°C. 

Recombinant proteins are recovered or collected either from the transfected ceils or 
the medium in which those cells are cultured. Recovery comprises cell disruption, 
isolation and purification of the recombinant protein. Isolation and purification 
techniques for polypeptides are well-known in the art and include such procedures as 
precipitation, filtration, chromatography, electrophoresis and the like. 

In a preferred embodiment, purification includes but is not limited to affinity 
purification of tagged or nontagged recombinant proteins. This is a well established 
robust technique easily adapted to any tagged protein by one skilled in the art. For 
affinity purification of tagged proteines, small molecules such as gluthathione, * 
maltose or chitin, specific proteins such as the IgG binding dorhain of Staphylococcus 
aureus protein A, antibodies or specific chelates which bind with high affinity to the 
tag of the recombinant protein are employed. For affinity purification of non-tagged 
proteins specific monoclonal or polyclonal antibodies, which were raised against said 
protein, can be used. Alternatively immobilized specific interactors of said protein 
may be employed for affinity purification. Interactors include native or recombinant 
proteins as well as native or artificial specific low molecular weight ligands. 

CHEMICAL SYNTHESIS OF THE POLYPEPTIDE ACCORDING TO THE 
INVENTION: 
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Alternatively, the protein itself may be produced using chemical methods to 
synthesize any of the amino acid sequences according to the invention (SEQ ID No: 
SEQ ID NO. 3, SEQ ID NO. 6, SEQ ID NO. 9, SEQ ID NO. 12 and/or SEQ ID NO. 
30) or that is encoded by the nucleotide sequences according to the invention (SEQ 
ID NO. 1 1 SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10 and/or SEQ ID NO. 28) or a 
portion thereof. For example, peptide synthesis can be performed using conventional 
Merrifield solid phase f-Moc or t-Boc chemistry or various solid-phase techniques 
(Roberge, J. Y. etal. (1995) Science 269: 202-204) and automated synthesis- may be 
achieved, for example, using the ABI 431 A Peptide Synthesizer. (Perkin Elmer). The 
newly synthesized peptjde(s) may be substantially purified by preparative high 
performance liquid chromatography (e.g., Greighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.). The composition 
of the synthetic peptides may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure; Creighton, supra). Additionally, the amino 
acid sequences according to the invention, i.e. SEQ ID NO. 3, SEQ ID NO. 6, SEQ 
ID NO. 9, SEQ ID NO. 12 and/or SEQ ID NO. 30 or the sequence that is encoded by 
SEQ ID NO. 1, SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10 and/or SEQ ID NO. 28 
or any part thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins, or any part thereof, to 
produce a variant polypeptide. 

COMPLEXES OF THE COFACTORS ACCORDING TO THE INVENTION WITH 
OTHER POLYPETIDES 

As outlined above CFs all bind the pregnane X receptor in vivo. In a preferred 
embodiment of the invention the CFs are complexed with this polypeptide or a 
portion thereof as disclosed in SEQ ID NO. 15 and/or 18. Such complexes are 
particular suited for all forms of binding or screening assays (see also below). Thus, 
in a preferred embodiment of the invention such assays are performed with 
complexes of the pregnane X receptor associated with one or more of the CF 
cofactors. v 

Such a complex may additionally comprise other cofactors such as RXR. In one 
embodiment of the invention a heterotrimeric complex of PXR, RXR and CF1 , CF2, 
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CF3, or CF4 is claimed. RXR herein refers equally to the alpha, beta or gamma form 
as encoded by SEQ ID NOs. 19, 22 and 25 or depicted in SEQ ID NOs. 21, 24 and 
27 or any portion thereof. 

Such heterotrimers and multimers may be used in binding and screening assays as 
outlined below. 

In one embodiment of the invention the entire CF polypeptide is part of the complex 
but only a portion, e.g. a truncated fragment of the other polypeptide;(PXR or RXR) is 
part of the complex. 

SCREENING ASSAYS 

In still a further embodiment, the present invention concerns a method for identifying 
new inhibitory or stimulatory substances of the cofactors according to the invention, 
these substances may be termed as "candidate substances". It is contemplated that 
this screening technique proves useful in the general identification of compounds that 
serve the purpose of Inhibiting or stimulating cofactor activity. f 

In one embodiment of the invention the following substances are disclosed as 
interactors of the PXR cofactor complexes: 

Steroids: dexamethasone-t-butylacetate, RU486, progesterone, 17-alpha- 
hydroxyprogesterone, 1,16-alpha dlmethylpregnenolone, 17-alpha- 
hydroxypregnenonlone, pregnenolone, 5beta-pregnane-3,20-dione, pregnenonlone- 
16-carbonitrile, 5beta-pregnane-3,20-dione f androstanol, corticosterone, 
dehydroepiandrosterone, dihydroxytestosterone, estradiol, Cortisol, cortisone, 
dihydroxytestosterone. 

Other substances: transnonachlor, chiordane, spironolactone, cyproterone acetate, 
rifampicin, nefipine, diethylstilbestrol, coumesterol, clotrimazole, lovastatin, 
phenoarbltal, pthalicacid, nonylphenol, 1,4-bis(2-(3 t 5-dichloropyridyloxy1))benzene, 
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One may use the screening method to identify such substances which activate the 
transactivation function of PXR, and thus lead to elevated expression of CYP3A 
genes. In turn, this will alter the ability of an individual treated with the substance to 
degrade xenobiotics, including drugs. ; 

This also includes the use of heteromultimeric complexes of the cofactor protein with 
other proteins, such as the pregnane x receptor protein, or any other binding partner. 

Accordingly, in screening assays to identify pharmaceuticals agents which affect 
cofactor acitivity, it is proposed that compounds isolated from natural sources, such 
as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or 
even extracts from animal sources, or marine, forest or soil samples, may be 
assayed for the presence of potentially useful pharmaceutical agents. 

It will be understood that that the pharmaceutical agents to be screened can also be 
derived from chemical compositions or man-made compounds. The candidate 
substances can could also include monoclonal or polyclonal antibodies, peptides or 
proteins, such as those derived from recombinant DNA technology or by other 
means, including chemical peptide synthesis. The active compounds may include 
fragments or parts or derivatives of naturally-occurring compounds or may be only 
found as active combinations of known compounds which are otherwise inactive. We 
anticipate that such screens will in some cases lead to the isolation of agonists of 
nuclear receptors or cofactors, In other cases to the isolation of antagonists. In other 
instances, substances will be identified that have mixed agonistic and antagonistic 
effects, or affect nuclear receptors or cofactors in any other way. 

In another embodiment, the invention concerns the isolation of substance inhibiting 
the interaction of the cofactor protein and the pregnane x receptor. Such substances 
are useful for the development of drugs against diseases such as metabolic 
disorders, immunological indications, hormonal dysfunctions and/or neurosystemic 
diseases ad diseases related to a different ability to degrade xenobiotic substances 
or related to defects in steroid homeostasis. Substances disrupting the interactions 
may be isolated by a variety of screening methods including the two hybrid system or 
the reverse two hybrid system (Lenna C.A. and .Hannink, M. 1996, Nucl. Acids Res. 
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24: 3341^3347), or any variation of cellular or cell free assays as described in this 
invention, as is obvious to anyone skilled in the art. 

In an important embodiment of the invention, the binding of the cofactor protein and 
the pregnane x receptor can be used to monitor the binding of a substance to one of 
the binding partners. The substance, which can be a small molecule such as a ligahd 
to a nuclear receptor will lead to a change in the allosteric conformation of the 
binding protein which in consequence leads to a loss of the Interaction of the two 
proteins. Using this effect of ligand-dependent protein-protein interactions one can 
design assays where the protein-protein interaction serves as a surrogate read-out 
for the binding of one of the proteins to small molecule ligand. Any assay method 
which is useful for the measurement of protein-protein interactions can be used for 
such an indirect assay. Such assay methods are well known in the art and include 
the methods described in this patent under "Cell free assays" and "Cell based 
assays". In a preferred embodiment, this assay will measure the binding of 
; substances to PXR, resulting in an effect on the interaction of PXR with the cofactor. 



CELL BASED ASSAYS 



To identify a candidate substance capable of Influencing the cofactor protein activity, 
one first obtains a recombinant cell line. One designs the cell line in such a way that 
the activity of the cofactor leads to the expression of a protein which has an easily 
detectable phenotype ( a reporter), such as luciferase, fluorescent proteins such as 
green or red fluorescent protein, beta-galactosidase, alpha-galactosidase, beta- 
lactamase, chloramphenicol-acetyl-transferase, beta-glucuronidase, or any protein 
which can be detected by a secondary reagent such as an antibody. 

Methods for detecting proteins using antibodies, such as ELISA assays, are well 
known to those skilled in the art. 

Here, the amount of reporter protein present reflects the activity of thexofactor. This 
recombinant cell line is then screened for the effect of substances on the expression 
of the reporters, thus measuring the effect of these substances on the activity of the 
cofactor. These substances can be derived from natural sources, such as fungal 
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extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or even 
extracts from animal sources, or marine, forest or soil samples, may be assayed for 
the presence of potentially useful pharmaceutical agents. It will be understood that 
that the pharmaceutical agents to be screened may be derived from chemical 
compositions or man-made compounds. 

The candidate substances can also include monoclonal or polyclonal antibodies, 
peptides or proteins, such as those derived from recombinant DNA technology or by 
other means, including chemical peptide synthesis. The active compounds may 1 
include fragments or parts or derivatives of naturally-occurring compounds or may be 
only found as active combinations of known compounds which are otherwise 
inactive. ' 

In general the assay can be performed by firstly bringing a suitable cell containing a 
reporter gene which transcription is influenced by the cofactors activity in contact with 
a compound and secondly monitoring the expression of the reporter gene to evaluate 
the effect of the compound on the activity of the cof actor. 

In other embodiments of the invention assays are included where measuring the 
activity of di- or multimeric complexes of the cofactor and other proteins such as 
PXR or RXR. Further included are assays aiming at the identification of compounds 
which specifically influence only the monomelic, homodimeric or homomultimeric 
form of the cofactor, or influencing only multimeric forms of the cofactor. Such assays 
include measuring the effect of a compound on the cofactor in the absence of a 
binding partner, and measuring the effect of a compound on the cofactor in the 
presence of a binding partner, such as PXR. One skilled in the art will find numerous 
more assays which are equally covered by the invention. 

A cell line where the activity of PXR or any other nuclear receptor determines the 
expression of a reporter can be obtained by generating an artificial promoter 
upstream of the reporter gene, which contains preferably multiple copies of HREs to 
which PXR or any other nuclear receptor binds. 
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Furthermore, transgenic animals described in the invention can be used to derive cell 
lines useful for cellular screening assays. 

Cell lines useful for such an assay include many different kinds of cells, including 
prokaryotic, animal, fungal, plant and human cells. Yeast cells can be used in this 
assay, including Saccharomyces cerevisiae and Schizosaccharomyces pombe cells. 

One way of building cellular assays is by measuring the effect of compounds is the 
use of the two hybrid system (see for example see, for example, U.S. Pat. No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 
268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; PCT Publication No. WO 94/10300, and U.S. Pat. 
No. 5;667,973), or possible variants of the basic two hybrid system as discussed e.g 
in Vidal M, Legrain P, Nucleic Acids Res 1999 Feb 15;27(4):919-29. Briefly, the two 
hybrid assay relies on reconstituting in vivo a functional transcriptional activator 
protein from two separate fusion proteins. In particular, the method makes use of 
chimeric genes which express hybrid proteins: To illustrate, a first hybrid gene 
comprises the coding sequence for a DNA-binding domain of a transcriptional 
activator fused in frame to the coding sequence for a cofactor. The second hybrid 
protein encodes a transcriptional activation domain fused in frame to another gene, 
for example PXR. If the cofactor and PXR proteins are able to Interact, they bring into 
close proximity the two domgins of the transcriptional activator. This proximity is 
sufficient to cause transcription of a reporter gene which is operably linked to a 
transcriptional regulatory site responsive to the transcriptional activator, and 
expression of the reporter gene can be detected and used to score for the interaction 
of the cofactor and PXR proteins. Suitable host cells for such assays include yeast 
cells, but also mammalian cells or bacterial cells. 

In such assays, one primarily measures the effect of a compound on a given 
interaction involving the CF cofactors and a binding protein. In a preferred 
embodiment of the invention systems using other hosts such as prokaryotes as E 
coli t or eukaryotic mammalian cells are described. 
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Two hybrid systems using hybrid protein fusions with other proteins than transcription 
factors, including enzymes such as beta-galactosidase or dihydrofolate reductase 
may also be applied. These assays are useful both to monitor the effect of a 
compound, including peptides, proteins or nucleic acids on an interaction of a 
cofactor with a given binding partner, as well as to identify novel proteins or nucleic 
acids interacting with the cofactor. 

CELL-FREE ASSAYS 

Recombinant forms of the polypeptides according to SEQ ID NO. 3, SEQ ID NO. 6, 
SEQ ID NO, 9, SEQ . ID NO. 12 and/or SEQ ID NO. 30 can be used in cell-free 
screening assays aiming at the isolation of compounds affecting the activity of 
cofactors. In such an assay, the cofactor polypeptides are brought into contact with a 
substance to test if the substance has an effect on the activity of the cofactors. 

The detection of an interaction between an agent and a cofactor may be 
accomplished through techniques well-known in the art. These techniques include 
but are not limited to centrifugation, chromatography, electrophoresis and 
spectroscopy. The use of isotopically labeled reagents in conjunction with these 
techniques or alone is also contemplated. Commonly used radioactive isotopes 
include 3 H, 14 C, . ^Na, 32 P, 33 P, ^S, 45 Ca, 60 Co, 125 l, and 131 1. Commonly used 
stable isotopes include 13 C, 15 N, 18 0. 

For example, if an agent binds to any of the cofactors of the present invention, the 
binding may be detected by using radiolabeled agent or radiolabeled cofactor. Briefly, 
if radiolabeled agent or radiolabeled cofactor is utilized, the agent-cofactor complex 
may be detected by liquid scintillation or by exposure to x-ray film or phosho-imaging 
devices. 

One way to screen for substances affecting cofactor activity is to measure the effect 
of the substance on the binding affinity of the cofactor to other proteins or molecules, 
such as activators or repressors, DNA, RNA, other proteins, antibodies peptides or 
other substances, including chemical compounds known to affect receptor activity or 
to a nuclear receptor itself. Assays measuring the binding of a protein to a ligand are 
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well known in the art, such as ELISA assays, FRET assays, bandshift assays, 
plasmon-resonance based assays, scintilllation proximity assays, fluorescence 
polarization assays, alpha screen assays. 

In one example, a mixture containing a cofactor polypeptide, effector and candidate 
substance is allowed to incubate. The unbound effector is separable from any 
effector/cofactor complex so formed. One then simply measures the amount of each 
(e.g., versus a control to which no candidate substance has been added). This 
measurement may be made at various time points where velocity data is desired. 
From this, one determines the ability of the candidate substance to alter or modify the 
function of the cofactor. 

Numerous techniques are known for separating the effector from effector/cofactor 
complex, and all such methods are intended to fall within the scope of the invention. 
This includes the use of thin layer chromatographic methods (TLC), HPLC, 
spectrophotometry, gas chromatographic/mass spectrophotometry or NMR 
analyses. Another method of separation is to immobilize one of the binding partners 
on a solid support, and to wash away any unbound material. It is contemplated that 
any such technique may be employed so long as it is capable of differentiating 
between the effector and complex, and may be used to determine enzymatic function 
such as by identifying or quantifying the substrate and product. 

A screening assay in which candidate agent binding of cofactors is analysed can 
include a number of conditions. These conditions include but are not limited to pH, 
temperature, tonicity, the presence of relevant other proteins, and relevant 
modifications to the polypeptide such as glycosylation or lipidation. It is contemplated 
that the cofactors can be expressed and utilized in a prokaryotic or eukaryotic cell. 
The host cell expressing the cofactors can be used whole or the cofactor can be 
isolated from the host cell. The cofactor can be membrane bound in the membrane of 
the host cell or it can be free in the cytosol of the host cell. The host cell can also be 
fractionated into sub-cellular fractions where the cofactor can be found. For example, 
cells expressing the cofactor can be fractionated into the nuclei, the endoplasmic 
reticulum, vesicles, or the membrane surfaces of the cell. 
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pH is preferably from about a value of 6.0 to a value of about 8.0, more preferably 
from about a value of about 6.8 to a value of about 7.8, and most preferably, about 
7.4. In a preferred embodiment, temperature is from about 20°C degrees to about 
50°C degrees more preferably, from about 30°C degrees to about 40°C degrees and 
even more preferably about 37°C degrees. Osmolality is preferably from about 5 
milliosmols per liter (mosm/L) to about 400 mosm/l, and more preferably, from about 
200 milliosmols per liter to about 400 mosm/l and, even more preferably from, about 
290 mosm/L to about 310 mosm/L. The presence of further cof actors or other 
proteins can be required for the proper functioning of the cpfactors according to ! the 
invention. Typical chemical cofactors include sodium, potassium, calcium, 
magnesium, and chloride. In addition, small, non-peptide molecules, known as 
prosthetic groups may also be required. Other biological conditions needed for 
cofactor function are well-known in the art. 

It is well-known in the art that proteins can be reconstituted in artificial membranes, 
vesicles or liposomes. (Danboldt et al.,1990). The present invention contemplates 
that the cofactor can be incorporated into artificial membranes, vesicles or liposomes. 
The reconstituted cofactor can be utilized in screening assays. 

It is further contemplated that a cofactor of the present Invention can be coupled to a 
solid support, e.g., to agarose beads, polyacrylamide beads, polyacrylic, sepharose 
beads or other solid matrices capable of being coupled to polypeptides. Well-known 
coupling agents include cyanogen bromide (CNBr), carbonyldiimidazole, tosyl 
chloride, diaminopimelimidate, and glutaraldehyde. 

In a typical screening assay for identifying candidate substances, one employs the 
same recombinant expression host as the starting source for obtaining the cofactor 
polypeptide, generally prepared in the form of a crude homogenate. Recombinant 
cells expressing the cofactor are washed and homogenized to prepare a crude 
polypeptide homogenate in a desirable buffer such as disclosed herein. In a typical 
assay, an amount of polypeptide from the cell homogenate, is placed into a small 
volume of an appropriate assay buffer at an appropriate pH. Candidate substances, 
such as agonists and antagonists, are added to the admixture in convenient 
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concentrations and the interaction between the candidate substance and the cofactor 
polypeptide is monitored (see also Fig. 1). 

Where one uses an appropriate known substrate for the cofactors, one can, in the 
foregoing manner, obtain a baseline activity for the recombinant^ produced 
cofactors. Then, to test for inhibitors or modifiers of the cofactor function, one can 
incorporate into the admixture a candidate substance whose effect on the cofactor is 
unknown. By comparing reactions which are carried out in the presence or absence 
of the candidate substance, one can then obtain information regarding the effect of 
the candidate substance on the normal function of the cofactor. 

Accordingly, this aspect of the present invention will provide those of skill in the art 
with methodology that allows for the identification of candidate-substances having the 
ability to modify the action of cofactor polypeptides in one or more manners. 

Additionally, screening assays for the testing of candidate substances are designed 
to allow the determination of structure-activity relationships of agonists or antagonists 
with the cofactors, e.g., comparisons of binding between naturally-occurring 
hormones or other substances capable of interactingwith or otherwise modulating 
the cofactor; or comparison of the activity caused by the binding of such molecules to 
the cofactor. 

In certain aspects, the polypeptides of the invention are crystallized in order to carry 
out x-ray crystallographic. studies as a means of evaluating interactions with . 
candidate substances or other molecules with the cofactor polypeptide. For instance, 
the purified recombinant polypeptides of the invention, i.e. of the cofactors according 
to the invention, when crystallized in a suitable form, are amenable to detection of 
intra-molecular interactions by x-ray crystallography. In another aspect, the structure 
of the polypeptides can be determined using nuclear magnetic resonance. 



PHARMACEUTICAL COMPOSITION: 



This invention provides a pharmaceutical composition comprising an effective 
amount of an agonist or antagonist drug identified by the method described herein 
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and a pharmaceutically acceptable carrier. Such drugs and carrier can be 
administered by various routes, for example oral, subcutaneous, intramuscular, 
intravenous or Intracerebral. The preferred route of administration would be oral at 
daily doses of about 0.01 -1 00 mg/kg. ' 

This invention provides a method of treating metabolic disorders, immunological 
indications, hormonal dysfunctions, neurosystemic diseases wherein the abnormality 
is improved by altering the activity of the cofactor thereby influencing the binding 
affinity of the cofactor to PXR, which could be useful for the treatment of disturbances 
in steroid homeostasis. Similarly, the invention also provides methods for treating 
diseases and conditions resulting from metabolic disorders, immunological 
indications, hormonal dysfunctions, neurosystemic diseases/or other diseases, 
which method comprises administering an effective amount of an agonist- or 
antagonist containing pharmaceutical composition described above. 

TRANSFORMATION OF CELLS AND DRUG SCREENING : 

The recombinant expression constructs of the present invention are useful in 
molecular biology to transform cells which do not ordinarily express the CFs to 
express these cofactors upon transformation. 

Such cells are useful as intermediates for making cellular preparations useful for 
cofactor binding assays, which are in turn useful for drug screening. 

The recombinant expression constructs of the present invention are also useful in 
gene therapy. Cloned genes of the present invention, or fragments thereof, may also 
be used in gene therapy carried out by homologous recombination or site-directed 
mutagenesis. See generally Thomas & Capecchi, Cell 51, 503-512 (1987); Bertling, 
Bioscience Reports 7, 107-1 12 (1987); Smithies et al., Nature 317, 230-234 (1985). 

Oligonucleotides of the present invention are useful as diagnostic tools for probing 
cofactor gene expression in tissues. For example, tissues are probed in situ with 
oligonucleotide probes carrying detectable groups by conventional autoradiographic 
techniques, as explained in greater detail in the Examples below, to investigate 
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native expression of this cofactor or pathological conditions relating thereto. Further, 
chromosomes can be probed to investigate the presence or absence of the CF 
genes, and potential pathological conditions related thereto, as also illustrated by the 
Examples below. Probes according to the invention should generally be at least 
about 15 nucleotides in length to prevent binding to random sequences, but, under 
the appropriate circumstances may be smaller. 

ANTIBODIES AGAINST THE COFACTOR PROTEIN OR POLYPEPTIDE 

Another aspect of the invention includes antibodies specifically reactive with the 
proteins of any parts of the proteins according to the invention (SEQ ID NO. 3, SEQ 
ID NO. 6, SEQ ID NO. 9, SEQ ID NO. 12 and/or SEQ ID NO. 30) and or polypeptides 
encoded by the nucleotide sequences of the cofactors or their complements (SEQ ID 
NO. 1 , SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10, SEQ ID NO. 2, SEQ ID NO. 5, 
SEQ ID NO. 8, SEQ ID NO. 11, SEQ ID NO. 28 and/or SEQ ID NO. 29). (The term 
„antibody" refers to intact molecules as well as fragments thereof, such as Fa, 
F(ab).sub.2, and Fv, which are capable of binding the epitopic determinant.) By using 
immunogens derived from the polypeptide according to the invention and/or encoded 
by the nucleic acids according to the invention, anti-protein/anti-peptide antiserum or 
monoclonal antibodies can be made by standard protocols (E. Howell & D. Lane. 
Antibodies: A Laboratory Manual. Cold Spring Harbor Laboratory (1988)). 

A polyclonal antibody Is prepared by immunizing a mammal, such as a mouse, a 
hamster or rabbit with an immunogenic form of the cofactors according to the 
invention depending on which of these are desired) of the present invention, and 
collecting antisera from that immunized animal. Because of the relatively large blood 
volume of rabbits, a rabbit is a preferred choice for production of polyclonal 
antibodies. 

As an immunizing antigen, fusion proteins, intact polypeptides or fragments 
containing small peptides of interest can be used. They can be derived by expression 
from a cDNA transfected in a host cell with subsequent recovering of the 
protein/peptide or peptides can be synthesized chemically (e.g. oligopeptides with 
10-15 residues in length). Important tools for monitoring the function of the cofactor 
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gene according to the present invention, I.e. encoded by a sequence according to 
SEQ ID NO. 1 , SEQ ID NO. 4, SEQ ID NO. 7, SEQ ID NO. 10 and SEQ ID NO. 28 
are antibodies against various domains of the proteins according to the invention. 

A given polypeptide or polynucleotide may vary in its immunogenicity. It is often 
necessary to couple the immunogen (e.g. the polypeptide) with a carrier. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin 
(BSA) and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to 
immunize the animal in the presence of an adjuvant, a non-specific stimulator of the 
immune response in order to enhance immunogenicity. The production of polyclonal 
antibodies is monitored by detection of antibody titers in plasma or serum at various 
time points following immunization. Standard ELISA or other immunoassays can be 
. 'used with the immunogen as antigen to assess the levels of antibodies. When a 
desired level of immunogenicity is obtained, the immunized animal may be bled and 
the serum isolated, stored and purified. 

To produce monoclonal antibodies, antibody-producing cells (e.g. spleen cells) from 
an immunized animal (preferably mouse or rat) are fused by standard somatic cell 
fusion procedures with immortalizing cells such'as myeloma cells to yield hybridoma 
cells. Where the immunized animal is a mouse, a preferred myeloma cell Is the 
murine NS-1 myeloma cell. Such techniques are well known in the art, and include, 
for example, the hybridoma technique (originally developed by Kohler & Milstein. 
Nature 256: 495-497 (1 975)), the human B cell hybridoma technique (Kozbar et al. 
Immunology Today 4:72 (1983)), and the EBV-hybridoma technique to produce 
human monoclonal antibodies (Cole et al. Monoclonal Antibodies and Cancer 
Therapy. Alan R. Liss, Inc. pp. 77-96 (1985)). 

The fused spleen/myeloma cells are cultured in a selective medium to select fused 
spleen/myeloma cells from the parental cells. Fused cells are separated from the 
mixture of non-fused parental ceils, for example/by the addition of agents that block 
the de novo synthesis of nucleotides in the tissue culture media. This culturing 
provides a population of hybridomas from which specific hybridomas are selected. 
Typically, selection of hybridomas is performed by culturing the cells by single-clone 
dilution in microtiter plates, followed by testing the individual clonal supernatants for 
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reactivity with an antigen-polypeptide. The selected clones may then be propagated 
indefinitely to provide the monoclonal antibody in convenient quantity. 

The creation of antibodies which specifically bind the polypeptides according to the 
invention and/or encoded by the nucleotide sequences of the cofactors or their 
complements provides an important utility in immunolocalization studies, and may 
play an important role in the diagnosis and treatment of such diseases and disorders 
as metabolic disorders, immunological indications, hormonal dysfunctions and/or 
neurosystemic diseases. The antibodies may be employed to Jdentifyrtissues, 
organs, and cells which express the cofactors. Antibodies can be used diagnostically 
in immuno-precipitation and immuno-blotting to detect and evaluate cofactor protein 
levels in tissue or from cells in bodily fluid as part of a clinical testing procedure. 

Monoclonal antibodies provided by the present invention are;also produced by 
recombinant genetic methods well known to those of skill in the art, and the present 
invention encompasses antibodies made by such methods that are immunologically 
reactive with an epitope of a mammalian cofactor protein or peptide according to the 
invention. 

The present invention encompasses fragments of the antibody that are 
immunologically reactive with an epitope of a cofactor protein or peptide. Such 
fragments are produced by any number of methods, including but not limited to 
proteolytic cleavage, chemical synthesis or preparation of such fragments by means 
of genetic engineering technology. The present invention also encompasses single- 
chain antibodies that are immunologically reactive with an epitope of a cofactor 
protein or peptide made by methods known to those of skill in the art. 

CHIMERIC ANTIBODIES AND OTHER TYPES. OF ANTIBODIES: 

The invention also includes chimeric antibodies, comprised of light chain and heavy 
chain peptides immunologically reactive to an epitope that is a cofactor protein or 
peptide according to the invention. The chimeric antibodies embodied in the present 
invention include those that are derived from naturally occurring antibodies as well as 
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chimeric antibodies made by means of genetic engineering technology well known to 
those of skill in the art. 

Also included are methods for the generation of antibodies against any , of the group 
comprising the peptides according to SEQ ID NO: 3, SEQ ID NO. 6, SEQ ID NO. 9, 
SEQ ID NO. 12 and/or SEQ ID NO. 30 which rely on the use, of phage display 
systems and related systems, such as described in Hoogenboorn HR, de Bruine AP, 
Hufton SE; Hoet RM, Arends JW, Roovers RC, Immunotechnology 1998 Jun;4(1):1- 
20, and references therein. 

EPITOPES OF THE COFACTORS 

' The present invention also encompasses one or more epitopes of a cofactor protein 
or peptide that is comprised of sequences and/or a conformation of sequences 
present in the cofactor proteins or peptide molecule. These epitopes may be. naturally 
occurring, or may be the result of proteolytic cleavage of the cofactor proteins or 
peptides and isolation of an epitope-containing peptide or may beobtained by 
synthesis of an epitope-containing peptide using a method of genetic engineering 
technology and synthesized by genetically engineered prokaryotjc or eukaryotic cells. 

ANTISENSE OLIGONUCLEOTIDES AGAINST COFACTOR GENE TRANSCRIPTS 

Antisense oligonucleotides are short single stranded DNA or RNA molecules which 
may be used to block the availability of the cofactor messengers). Synthetic 
derivatives of ribonucleotides or desoxyribonucleotides and/or PNAs (see above) are 
equally possible. These are potential candidate agents which may interact with the 
cofactor according to the invention. 

The sequence of an antisense oligonucleotide is at least partially complementary to 
the sequence of the cofactor of interest. The complementarity of the sequence is in 
any case high enough to enable the antisense oligonucleotide to bind to the nucleic 
acid according to the invention or parts thereof (SEQ ID NO. 1 , SEQ ID NO. 4, SEQ 
ID NO. 7, SEQ ID NO. 10 and/or SEQ ID NO. 28) in which the binding of 
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oligonucleotides to the target sequence interfere with the biological function of the 
targeted sequence (Brysch W, Schlingensiepen KH, Design and application of 
antisense oligonucleotides in cell culture,, in vivo, and as therapeutic agents, Cell Mol 
Neurobiol 1994 Oct; 14(5):557-68; Wagner RW, Gene inhibition using antisense 
oligodeoxynucleotides, Nature 1994 Nov 24;372(65G4): 333-5 or Brysch W, Magal E, 
Louis JC, Kunst M, Klinger I, Schlingensiepen R, Schlingensiepen KH Inhibition of 
p185c-erbB-2 proto-oncogene expression by antisense oligodeoxynucleotides down- 
. regulates p185-associated tyrosine-kinase activity and' strongly inhibits mammary 
tumor-cell proliferation, Cancer Gene Ther 1994 Jun;1(2):99-105 or Monia BP, 
Johnston JF, Ecker DJ, Zounes MA, Lima WF, Freier SM Selective inhibition of 
mutant Ha-ras mRNA expression by antisense oligonucleotides, J Biol Chem 1992 
Oct 5;267(28):1 9954-62 or Bertram J, Palfner K, Killian M, Brysch W, 
Schlingensiepen. KH, Hiddemann W, Kneba M, Reversal of multiple drug resistance 
in vitro by phosphorothioate oligonucleotides and ribozymes, Anticancer Drugs 1995 
Feb;6(1):124-34) 

This interference occurs in most instances at the level of translation, /.e. through the 
inhibition of the translational machinery by oligonucleotides that bind to mRNA, 
however, two other mechanisms of interference with a given gene's function by 
oligonucleotides can also be envisioned, (i) the functional interference with the 
transcription of a gene through formation of a triple helix at the level of genomic DNA 
and the interference of oligonucleotides with the function of RNA molecules that are 
executing at least part of their biological function in the untranslated form 
(Kochetkova M, Shannon MF, Triplex-forming oligonucleotides and their use in the 
analysis of gene transcription. Methods Mol Biol 2000;130:189-201 Rainer B. Lanzl , 
Neil J. McKennal, Sergio A. Onatel, Urs Albrecht2, Jiemin Wongl, Sophia Y. Tsail, 
Ming-Oer Tsail , and Bert W . O'Malley A Steroid Receptor Coactivator, SRA, 
Functions as an RNA and Is Present in an SRC-1 Complex Cell, Vol. 97, 17-27, 
April, 1999). 

Antisense oligonucleotides can be conjugated to different other molecules in order to 
deliver them to the cell or tissue expressing any of the cofactor genes. For instance 
the antisense oligonucleotide can be conjugated to a carrier protein (e.g. ferritin) in 
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order to direct the oligonucleotide towards the desired target tissue, i.e. in case of 
ferritin predominantly to the liver. 

Antisense expression constructs are expression vector systems that allow the 
expression - either inducible or uninducible - of a complementary sequence to the 
cofactor sequences according to the invention/The potential possibility of such an 
approach has been demonstrated in many different model systems (von Ruden T, 
Gilboa E, Inhibition, of human T-cell leukemia virus type I replication in primary 
human T cells that express antisense RNA, J Virol 1989 Feb;63(2):677-82; Nemir M, 
Bhattacharyya D, Li X, Singh K, Mukherjee AB, Mukherjee BB, Targeted inhibition of 
osteopontin expression in the mammary gland causes abnormal morphogenesis and 
lactation deficiency, J Biol Chem 2000 Jan 14;275(2):969-76; Ma L, Gauville C, 
Berthois Y, Millot G, Johnson GR, Calvo F Antisense expression for amphiregulin 
suppresses tumorigenicity of a transformed human breast epithelial cell line, 
Oncogene 1999 Nov 11;18(47):6513-20; Refolo LM, Eckman C, Prada CM, Yager D, 
Sambamurti K, Mehta N, Hardy J, Younkin SG, Antisense-induced reduction of 
presenilin 1 expression selectively increases the production of amyloid beta42 in 
transfected cells, J Neurochem 1999 Dec;73(6):2383-8; Buckley NJ, Abogadie FC, 
Brown DA, Dayrell M, Caulfield MP, Delmas P, Haley JE, Use of antisense 
expression plasmids to attenuate G-protein expression in primary neurons, Methods 
Enzymol 2000;314:136-48). 

According to the invention an antisense expression construct can be constructed with 
virtually any expression vector capable of fulfilling at least the basic requirements 
known to thoseskllled in the art. 

In one embodiment of the invention retroviral expression systems or tissue specific 
gene expression systems are preferred. 

Current standard technologies for delivering antisense constructs are performed 
through a conjugation of constructs with liposomes and related, complex-forming 
compounds, which are transferred via electroporation techniques or via particle- 
mediated "gene gun" technologies into the cell. Other techniques may be envisioned 
by one skilled in the art. 
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Microinjection still plays a major role in most gene transfer techniques for the 
generation of germ-line mutants expressing foreign DNA (including antisense RNA 
constructs) and is preferred embodiment of the present invention. 

RIBOZYMES DIRECTED AGAINST CF GENE TRANSCRIPT. 

Ribozymes are either RNA molecules (Gibson SA, Pellenz C.HutchisoniRE, Davey 
FR, Shillitoe EJ, Induction of apoptosis in oral cancer cells by an anti-bcl-2 ribozyme 
delivered by an adenovirus vector, Clin Cancer Res 2000 Jan;6(1):213-22; Folini M, 
Colella G, Villa R, Lualdi S, Daidone MG, Zaffaroni N, Inhibition of Telomerase 
Activity by a Hammerhead Ribozyme Targeting the RNA Component of Telomerase 
in Human Melanoma Cells, J Invest Dermatol 2000 Feb;114(2):259-267; Halatsch 
ME, Schmidt U, Botefur IC, Holland JF, Ohnuma T, Marked inhibition of glioblastoma 
-target cell tumorigenicity in vitro by retrovirus-mediated transfer of a hairpin ribozyme 
against deletion-mutant epidermal growth factor receptor messenger RNA, J 
Neurosurg 2000 Feb;92(2):297-305; Ohmichi T, Kool ET, The virtues of self-binding: 
high sequence specificity for RNA cleavage by self-processed hammerhead 
ribozymes, Nucleic Acids Res 2000 Feb 1;28(3):776-783) or DNA molecules (Li J, 
Zheng W, Kwon AH, Lu Y, In vitro selection and characterization of a highly efficient 
Zn(ll)-dependent RNA-cleaving deoxyribozyme; Nucleic Acids Res 2000 Jan 
15;28(2):481-488) that have catalytic activity. The catalytic activity located in one part 
of the RNA (or DNA) molecule can be "targeted" to a specific sequence of interest by 
fusing the enzymatically active RNA molecule sequence with a short stretch of RNA 
(or DNA) sequence that is complementary to the cofactor gene transcript of interest. 
Such a construct will, when introduced into a cell either physically or via gene 
transfer of a ribozyme expression construct find the corresponding cofactor sequence 
(our sequence of interest or also targeted in RNA) and bind via its sequence-specific 
part to said sequence. The catalytic activity attached to the construct, usually 
associated with a special nucleic acid structure (people distinguish so called 
"hammerhead" structures and "hairpin" structures), will then cleave the targeted RNA. 
The targeted mRNA will be destroyed and cannot be translated efficiently, thus the 
protein encoded by the mRNA derived from cofactor will not be expressed or at least 
will be expressed at significantly reduced amounts. 
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These are potential candidate agents which may interact with the cofactor according 
to the invention. 

In a preferred embodiment the invention covers inducible ribozyme constructs 
(Koizumi M, Soukup GA, Kerr JN, Breaker RR, Allosteric selection of ribozymes that 
respond to the second messengers cGMP and cAMP, Nat Struct Biol 1999 
Nov;6(11):1062-1071). 

In a further preferred embodiment the invention concerns the use of "bivalent" 
ribozymes (multimers of catalytically active nucleic acids) as described in (Hamada 
M, Kuwabara T, Warashina M, Nakayama A, Taira K, Specificity of novel 
allosterically trans- and cis-activated connected maxizymes that are designed to 
suppress BCR-ABL expression FEBS Lett 1999 Nov 12;461(1-2):77-85). 

TRANSGENIC ANIMALS CARRYING THE CF1 , CF2, CF3, CF4 AND/OR CF44 
COFACTOR GENE 

Also provided by the present invention are non-human transgenic animals grown 
from germ cells transformed with a CF1, CF2, CF3, CF4 or CF44 nucleic acid 
sequence according to the invention and that express the cofactor according to the 
invention and offspring and descendants thereof. Also provided are transgenic non- 
human mammals comprising a homologous recombination knockout of the native 
cofactors, as well as transgenic non-human mammals grown from germ cells 
transformed with nucleic acid antisense to the nucleic acids of the invention and 
offspring and descendants thereof. Further included as part of the present Invention 
are non-human transgenic animals in which the native cofactor has been replaced 
with the human ortholog. Of course, offspring and descendants of all of the foregoing 
transgenic animals are also encompassed by the invention. 

Transgenic animals according to the invention can be made using well known 
techniques with the nucleic acids disclosed herein. E.g., Leder et al., U.S. Patent 
Nos.4,736,866 and 5,175,383; Hogan et al. ( Manipulating the Mouse Embryo, A 
Laboratory Manual (Cold Spring Harbor Laboratory (1986)); Capecchi, Scienc 244, 
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1288 (1989); Zimmerand Gruss, Nature 338, 150 (1989); Kuhn et al., Science 269, 
1427 (1995); Katsuki et al., Science 241, 593 (1988); Hasty et al., Nature 350, 243 
. (1991); Stacey et al., Mol. Cell Biol. 14, 1009 (1994); Hanks et al., Science 269, 679 
(1995); and Marx, Science 269, 636 (1995). Such transgenic animals are useful for 
screening for and determining the physiological effects of the cofactor agonists and 
antagonist. 

Consequently, such transgenic animals are useful for developing drugs to regulate 
physiological activities in which the cofactors participate. 

The following Examples are provided for illustrative purposes only and are not 
intended, nor should they be construed, as limiting the invention in any manner. 

MODELLING OF THE STRUCTURE OF CF1 , CF2, CF3, CF4 AND/OR CF44 

In one embodiment of the invention the amino acid sequences of the present 
invention can be used for structural drug design. Aim is to produce structural-analogs 
of biologically active polypeptides of interest or of small molecules with which they 
interact (e.g. agonists, antagonists or inhibitors) in order to design drugs which are, 
for example, more active or stable forms of the polypeptide, or which, for example, 
enhance or interfere with the function of a polypeptide in vivo. In one approach one 
first determines the three-dimensional structure of a protein of interest, i.e. the 
cofactor, by computer-modeling, x-ray crystallography or a combination of both 
approaches. Additional useful information with respect to the structure of a 
polypeptide could also be gained from comparison of the protein sequence of the 
protein of interest with the sequence of related proteins where the structure Is known. 
From the three-dimensional structure, binding sites of potential inhibitors or activators 
can be predicted. It can further be predicted which kinds of molecule might bind 
there. The predicted substances can then be screened to test their effect on the 
activity of the protein and its biological function. 

EXAMPLES 
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EXAMPLE 1 : CLONING AND EXPRESSION OF THE GENES ACCORDING TO 
THE INVENTION 

Construction of suitable vectors containing the desired coding and control sequences 
employs standard ligation and restriction techniques that are well understood in the 
art. Isolated plasmids, DNA sequences, or synthesized oligonucleotides are cleaved, 
tailored, and religated in the form desired. 

Site-specific DNA cleavage is performed by treatment with-the suitabletrestriction 
enzyme (or enzymes) under conditions that are generally understood in the art, and 
the particulars of which are specified by the manufacturer of these commercially 
available restriction enzymes. 

See, e.g., New England Biolabs, Product Catalog. In general, about 1 pg of plasmid 
and/or DNA sequence is cleaved by one unit of enzyme in about 20 pi of buffer 
solution. Often excess of restriction enzyme is used to ensure complete digestion of 
the DNA substrate. Incubation times of about one hour to two hours at about 37°C 
are workable, although variations are tolerable. 

After each incubation, protein is removed by extraction with phenol/chloroform, and 
may be followed by ether extraction. The nucleic acid may be recovered from 
aqueous fractions by precipitation with ethanol. If desired, size separation of the 
cleaved fragments may be performed by polyacrylamide gel or agarose gel 
electrophoresis using standard techniques. A general description of size separations 
is found in Methods in Enzymology 65, 499-560 (1980). 

Transformed host cells are cells which have been transformed ortransfected with 
recombinant expression constructs made using recombinant DNA techniques and 
comprising cofactor encoding sequences. Preferred host cells for transient 
transfection are COS-7 cells. Transformed host cells may ordinarily express one of 
the cofactors CF1, CF2, CF3, CF4 or CF44, but host cells transformed for purposes 
of cloning or amplifying nucleic acid hybridization probe DNA need not express the 
cofactors. When expressed, the cofactor proteins will typically be located in the host 
cell membrane. 
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Cultures of cells derived from multicellular organisms are desirable hosts for 
recombinant nuclear receptor protein synthesis. In principal, any higher eukaryotic 
cell culture is workable, whether from vertebrate or invertebrate culture. However, 
mammalian cells are preferred. Propagation of such cells in cell culture has become 
a routine procedure. See Tissue Culture (Academic Press, Kruse & Patterson, Eds., 
1973). Examples of useful host cell lines are bacteria cells, insect cells, yeast cells, 
human 293 cells, VERO and HeLa cells, LMTK- cells, and WI138, BHK, COS-7, CV, 
and MDCK cell lines. Human 293 cells are preferred. 

EXAMPLE 2: COFACTOR CF1 , CF2, CF3, CF4 AND CF44 OR TISSUE 
LOCALIZATION: 

A multiple tissue northern blot (Clontech, Palo Alto) is hybridized to a labeled probe. 
The blot contains about 0.3 to 3 pg of poly A RNA derived from various tissues. 
Hybridization may be carried out in a hybridization solution such as one containing 
SSC (see Maniatis et al, ibid) at an optimized temperature between 50°c and 70°C, 
preferably 65°C. The filter may be washed and a film exposed for signal detection 
(see also: Maniatis et al., Molecular Cloning: A laboratory Manual, Cold Spring Harbor 
Laboratory Press, N.Y.(1 989)). 

EXAMPLE 3: COFACTOR cDNA ISOLATION FROM HUMAN AND OTHER 
ORGANISMS: 

A cloning strategy is used to clone the CF1, CF2, CF3, CF4 or CF44 cofactor cDNA 
from specific cDNA libraries (Clontech, Palo Alto) or alternatively, RNA is obtained 
from various tissues and used to prepare cDNA expression libraries by using for 
example an Invitrogen kit. (Invitrogen Corporation, San Diego). For the isolation of 
the CF cDNA clones the chosen library may be screened under stringent condition 
(see definitions above) by using CF1, CF2, CF3, CF4 or CF44 specific probes. The 
cDNA insert of positive clones is subsequently sequenced and cloned in a suitable 
expression vector. 
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Additionally, full length cofactor clones from various species are obtained by using 
RACE PCR technology. In brief, suitable cDNA libraries are constructed or 
purchased. Following reverse transcription, the first strand cDNA is used directly in 
RACE PCR reactions using a RACE cDNA amplification kit according to the 
manufactures protocol (Clontech, Palo Alto). Amplified fragments are purified, 
cloned and subsequently used for sequence analysis. 

To obtain information about the genomic organization of the cofactor gene; genomic 
libraries (Clontech, Palo Alto) are screened with a receptor ^specific probe under 
stringent conditions. Positive clones are isolated and the complete DNA sequence of 
the putative receptor is determined by sequence analysis (Maniatis et al„ Molecular 
Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)). 

EXAMPLE 4: ISOLATION OF THE COFACTOR PROTEINS BY USE OF THE 
YEAST TWO-HYBRID SYSTEM 

A yeast two-hybrid assay was performed . using methods such as described by Fields 
and Song Nature 340, pp245 (1989), Bartel et al., Biotechniques 14, pp920 (1993) 
and Lee et al. Nature 374 pp91-4 (1995). A sequence encoding amino acids 106-434 
of PXR (containing the ligand binding domain; LBD) was cloned into the vector 
pGBT9 (Clontech) In such way that, after transformation of the haploid yeast strain 
CG1945 (Clontech), a hybrid protein is expressed consisting of the DNA-binding 
domain (BD) of the Gal4 transcription factor fused N-terminally to amino acids 106- 
434 of PXR. CG1945 cells expressing the Gal4BD::PXR fusion protein were mated to 
cells of strain Y187 (Clontech) containing a library of Gal4 transcription activation 
domain (AD) fusion plasmids with human cDNA generated from a range of tissues 
inserted into the vector pACT2 (Clontech). All libraries were purchased from Clontech 
Laboratories (MATCHMAKER human cDNA libraries) and included Cat. numbers 
HL4040AH (aorta), HL4041AH (chondrocytes), HY4004AH (brain), HY4035AH 
(testis), HY4024AH (liver), HY4042AH (heart), HY4053AH (bone marrow) and 
HY4028AH (fetal brain). The two-hybrid screens were essentially performed following 
the Clontech "Pretransformed Matchmaker Libraries User Manual" (PT3183-1): 
Transformed CG1945 and Y187 cells were mated in order to coexpress the 
Gal4::PXR fusion protein and the Gal4AD fusion proteins encoded on the library 
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plasmids within one cell. Interaction of the two hybrid proteins led to activation of 
reporter gene transcription. Cells were selected for interactions of PXR with library 
proteins on medium lacking tryptophan, leucine and histidlne and were further 
assayed, for expression of a-galactosidase, encoded by the MEL1 reporter gene. 
Colonies which were positive for reporter gene activation were chosen for further 
analysis. The DNA inserts of the library plasmids contained in these colonies were 
amplified by use of the polymerase chain reaction directly-on the yeastcoloriies using 
oligonucleotide primer which hybridize on vector sequences flanking both sides of the 
insert. The identity of the insert was determined by standard s DNA sequencing 
techniques. 

Five novel cofactors Interacting with PXR were isolated using this approach: CF1 
was isolated from the heart cDNA library, CF2 from the aorta ..testis, fetal brain and 
brain cDNA libraries, CF3 from the fetal brain and heart cDNA libraries, CF4 from the 
heart cDNA library and CF44 from the chondrocyte library. 

EXAMPLES: DETECTION OF MUTANT ALLELES OF THE GENE(S) ACCORDING 
TO THE INVENTION AND THEIR UTILISATION FOR DIAGNOSTIC PURPOSES. 

According to the diagnostic and prognostic method of the present invention, 
alteration of the wild-type cofactor gene Is detected. In addition, the method can be 
performed by detecting the wild-type cofactor gene and confirming the lack of cause 
of the disease as a result of the locus. 

"Alteration of the wild-type gene" encompasses all forms of mutations including 
deletions, insertions and point mutations in the coding and non-coding regions. 
Deletions may be of the entire gene or of only a portion. Point mutations may result in 
stop codons, frameshift mutations or amino acid substitutions. Somatic mutations are 
those which occur only in certain tissues and are not inherited in the germline. 
Germline mutations can be found in any of a body's tissue and are mostly inherited. 
Point mutational events may occur in regulatory regions, such as the promotor of the 
gene, leading to loss or dimunition of expression of the mRNA. Point mutations may 
also abolish proper RNA processing, leading to loss of expression of the cofactor 
gene product or to a decrease in mRNA stability or translation efficiency. 
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Applicable diagnostic techniques include, but are not limited to fluorescent in 
situ hybridization (FISH), direct DNA sequencing, PFGE analysis, Southern blot 
analysis, single stranded conformation analysis (SSCA), RNAse protection assay, 
allele-specific oligonucleotide (ASO), dot blot analysis, , hybridization using nucleic 
acid modified with gold nanoparticles and PCR-SSCP, as discussed in detail further 
below/Furthermore, DNA microchip technology can be applied. 

The presence of a disease due to a germiine mutation : of arcofactor xan be 
ascertained by testing any tissue of the diseased human for mutations of the cofactor 
gene. For instance, a person who has inherited a germiine mutation in the cofactor 
gene, especially one that will alter the interaction of the cofactor with the PXR 
protein, will be prone to develop a disease. The presence of such a mutation can be 
determined by extracting DNA from any tissue of the body. For example, blood can 
be drawn and DNA extracted from blood cells and analyzed. Moreover, prenatal 
diagnosis of the disease will be possible by testing feta! cells, placental cells or 
amniotic cells for mutations in the cofactor gene. There are several methods that 
allow the detection of alterations of the wild-typ cofactor gene, including for instance 
point mutations as well as deletions in the DNA sequence and these method? are 
discussed here: 

Direct genomic DNA Sequencing, either manual or by automated means can detect 
sequence variations of cofactor genes (Nucleic Acids Res 1997 May 15;25(10):2032- 
2034 Direct DNA sequence determination from total genomic DNA. Kilger C, Paabo S, 
Biol. Chem. 1997 Feb; 378(2):99-105, Direct exponential amplification and sequencing 
(DEXAS) of genomic DNA. Kilger C, Paabo S, DE 19653439.9 and DE 19653494,1). 
Another way is to make use of the single-stranded conformation polymorphism 
assay (SSCP; Orita et al., PNAS 86, 2766 (1989)). Variations in the DNA sequence of 
the cofactor gene from the wild-type sequence will be detected due to a shifted 
mobility of the corresponding DNA-fragments in SSCP gels. 

Other approaches are based on the detection of mismatches between the two 
complementary DNA strands. These methods, which will not allow the detection of 
large deletions, duplications or insertions nor the detection of a regulatory mutation 
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affecting transcription or translation of the cofactor gene include the clamped 
denaturing gel electrophoresis (GDGE; Sheffield et ai.,1991), heteroduplex analysis 
(HA; White et al., Genomics 4, 560 (1992)) and chemical mismatch cleavage (CMC; 
Grompe et al., 1989). Other methods detect specific types of mutations such as 
deletions, duplications or insertions, for instance a protein truncation assay or the 
asymmetric assay. These assay however, will not detect , missense mutations. A 
review of currently available methods of detecting DNA sequence variation can be 
found in a review by Grompe, Nature Genetics 5, 111 (1993). Once a mutation is 
known, an allele specific detection approach such as allele, specific oligonucleotide 
(ASO) hybridisation will allow the rapid screening of a large number of other sample 
for that mutation. Such a technique may involve the utilisation of probes which are 
labeled with gold nanoparticles to to yield a visual colour result (Elghanian et 
aL.Science 277, 1078(1997)). 

In another embodiment of the present invention large scale genetic studies might be 
applied to Investigate the association of a disease-phenotype with the gene of 
; interest. The availability of the human genome allows an easy definition of genetic 
\ markers for most genes for a particular disease physiology. More importantly, single 
nucleotide polymorphisms (SNPs) are amenable markers for large genetic studies. 
SNPs in coding or regulatory regions of genes which are thought to contribute to a 
disease physiology can have a direct impact on the phenotype, e.g. change a 
quantitative readout of disease physiology, for example the age of onset of heart 
attack. Association and linkage studies with related individuals, therefore provide an 
excellent means to test or verify a hypothesis on the functional impact of the gene of 
interest on disease physiology in vivo, in humans. 

The PXR protein is known to be involved in controlling the expression of the 
cyclooxgenase P450 or Cyp3A gene. The product of the Cyp3A gene product is a 
hydroxylase which is involved in the metabolism of xenobiotics, including the majority 
of drugs in use. Therefore, alterations identified in the PXR gene can be used to 
predict the ability of a given human individual to metabolize xenobiotics. Proteins 
interacting with PXR, such as the cofactors according to the invention will also be 
involved in the function of PXR. Therefore, alterations in the cofactors are useful for 
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determining the genetic state of a person with respect to its abilitiy to metabolise 
xenobiotics, including drugs. 

One embodiment of the invention is the use of genetic testing of individuals for 
alterations in the genes of the cofactor which interact with PXR to try and predict the 
individuals' capabilities of metabolising xenobiotic substances. This can be done 
using any of the methods described to determine which alleles are present in the 
genome of a given person. In particular, one embodiment is the use of the 
information on polymorphisms in the genes coding for the cofactors to try and predict 
the metabolism of certain drugs. Such drugs include substances which are 
metabolized via the product of the Cyp3a4, or related enzymes. In one preferred 
embodiment, the information on the genetic variations in the cofactors genes is used 
to predict the effect of hyperforin (St Johns worth) on a individual. In another 
preferred embodiment, the information on the genetic variations in the cofactor genes 
is used to predict the effect of steroids and steroid derivatives, such as 17beta 
estradiol and dexamethasone or pregnenolone. The information of the genetic state 
of an individual is also useful to predict drug-drug interactions. 

In order to detect polymorphisms in DNA sequences, DNA samples can be prepared 
from normal individuals and from persons being affected by the disease and these 
samples can be cut by one or more restriction enzymes and applied to Southern 
analysis. Southern blots displaying hybridizing fragments differing in length from the 
control DNA when probed with sequences near or including the cofactor locus could 
indicate a possible mutation. If large DNA fragments are used it is appropriate to 
separate these fragments by pulsed field gel electrophoresis (PFGE). 

Detection of point mutations may be accomplished by amplification, for instance by 
PCR, from genomic or cDNA and sequencing of the amplified nucleic or by molecular 
cloning of the cofactor allele . and sequencing the allele using techniques well known 
In the art. 

There are six well known methods for a more complete, yet still indirect, test for 
confirming the presence of a susceptibility allele: 1) single stranded conformation 
analysis (SSCP) (Orita et al., PNAS t 86, 2766 (1989)); 2) denaturing gradient gel 
electrophoresis (DGGE) (Wartell et al, NAR 18, 2699, (1990); Sheffield et al., PNAS 
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86 , 232 (1989)); 3) RNase protection assays (Finkeistein et al., Genomics 7, 167 
(1990); Kinszler et al., Science 251, 1366 (1991)); 4) allele specific oligonucleotides 
(ASOs, Conner et al., PNAS, 80, 278 (1983)); 5) the use of proteins which recognise 
nucleotide mismatches, such as the E. coli mutS protein (Modrich, Ann. Rev. 
Genetics, 25, 229 (1991)) and 6) allele-specific PCR (Ruano and Kidd, NAR 17, 8392 
(1989)). For allele-specific RCR, primers are used which hybridise at their 3* ends to 
a -particular cofacto mutation. Without the mutation, no PCR product is observed. 
Amplification Refractory Mutation System could also be ' used, vas disclosed in 
European Patent Application Publication No. 0332435 and in Nevvrton-etaL, NAR M, 
2503 (1989). Insertions and deletions of genes can also be detected by molecular 
cloning, amplification and sequencing. Moreover, restriction fragment length 
polymorphism (RFLP) probes for the gene or surrounding marker genes canbe used 
to score for alteration of an allele or an insertion in a polymorphic fragment. Such a 
method would be particularly useful for screening relatives of an affected person for 
the presence of the mutation found in that person. Other approaches for detecting 
insertions and deletions as known for those trained in the art can be used. 

SSCP detects a band which migrates differently because the variation causes a 
difference in single strand, intra molecular base pairing. The RNAse protection assay 
involves cleavage of the mutant fragment into two or more smaller fragments. By 
using DGGE variations in the DNA can be detected by differences in the migration 
rates of mutant compared to normal alleles in a denaturing gradient gel. In the mutS 
assay, the protein binds only to sequences that contain a nucleotide mismatch in a 
hetero duplex between mutant and wild-type sequences. 

Mismatches, according to the present invention, are hybridised nucleic acid duplexes 
in which the two strands are not 100% complementary. Lack of total homology may 
be due to deletions, insertions, inversions or substitutions. Mismatch detection can 
be used to detect point mutations in the gene or the corresponding mRNA product. 
While these techniques are less sensitive than sequencing, they can preferably be 
used when a large number of samples shall be tested. An example of the a mismatch 
cleavage method is the RNAse protection assay. In the practice of the present 
invention , the method Involves the use of a labeled ribonucleotide probe which is 
complementary to the wild-type sequence of the cofactor gene coding sequence.The 
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riboprobe and either mRNA or DNA isolated from the person are hybridised together 
and subsequently digested with the enzyme RNase A which is able to detect some 
mismatches in a duplex RNA structure. If a mismatch is detected by the enzyme, it 
cleaves at the site of the mismatch. Consequently, when the annealed RNA 
preparation is separated on an electrophoretic gel matrix, if a mismatch has been 
detected and cleaved by RNAse A, an RNA product will be seen which is smaller 
than the full length duplex RNA for the riboprobe and the mRNA. or DNA. If, the 
riboprobe comprises only a fragment of the mRNA or the gene, it is: advantageous to 
use a number of probes to screen the whole mRNA sequence for mismatches. 

Similarly, DNA probes can be used to detect mismatch mutations through enzymatic 
or chemical cleavage (Cotton et al., PNAS 85, 4397 (1988); Shenk et al., PNAS 72, 
989 (1975); Novack et al., PNAS 83, 586 (1986)). Alternatively, mismatches can be 
detected by shifts in the electrophoretic mobility of mismatched' duplexes relative to 
match duplexes (Cariello, Human Genetics 42, 726 (1988)). With either riboprobes or 
DNA probes, the cellular mRNA or DNA which might contain a mutation can be 
amplified using PCR (see below) before hybridisation. Variations in DNA of the 
cofactor) gene can also be detected using Southern hybridisation, especially if the 
changes are major rearrangements, such as deletions or insertions. DNA sequences 
of the cofactor gene which have been amplified by PCR may also be screened using 
allele specific probes. These probes are nucleic acid oligomers, each of which 
contains a region of the gene sequence harboring a known mutation. For instance, 
one oligomer coiild be about 25 nucleotides in length corresponding to a portion of 
the gene sequence. By using a number of such allele-specific probes, PCR 
amplification products can be screened to identify the presence of a previously 
discovered mutation in the gene. Hybridisation of allele-specific probes with amplified 
cofactor sequences can be performed, for example, on a nylon filter. Under high 
stringency hybridisation conditions, the hybridisation of a particular probe should 
indicate the presence of the same mutation in the tissue as in the allele-specific 
probe. 

The newly developed technique of nucleic acid analysis via microchip technology is 
also applicable to the present invention^ In this technique, thousands of distinct 
nucleotide probes are built up in an array on a silicon chip. Nucleic acid to be 
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analysed is fluorescently labeled and hybridised to the probes on the chip. It is also 
possible to study nucleic acid-protein interactions using these nucleic acid 
microchips. Using this technique one can determine the presence of mutations or 
even sequence the nucleic acid being analysed or one can measure expression of a 
gene of interest. This method Is one of parallel processing of thousands of probes at 
once and can tremendously accelerate the analysis. In several publications the use 
of this method is described (Hacia et al., Nature Genetics 14, 441 (1996); Shoemaker 
et al., Nature Genetics 14, 450 (1996); Chee et al. a Science 274, 610 (1996); DeRisi 
et al., A/aft/re Genetics 14, 457 (1996)). This new technology has also been reviewed 
in Borman et al., Chemical and Engineering News 9, 42 (1996) and has been subject 
of an editorial in Nature Genetics (1 996). 

The most definite test for mutations in a candidate locus is to directly compare 
genomic cofactor sequences from patients with those from normal individuals. 
Alternatively one could sequence mRNA after amplification (for example by PCR) 
thereby eliminating the necessity of determining the exon structure of the respective 
gene. 

Mutations from patients falling outside the coding region of the cofactor gene can be 
detected by examining the noncoding regions, such as introns and regulatory 
sequences within or near the genes. Early indications of mutations in noncoding 
regions could be for example the abundance or abnormal size of mRNA products in 
patients as compared to control individuals as detected by northern blot analysis. 

Alteration of cofactor expression can be detected by any techniques known in the art. 
These include northern blot analysis, PCR amplification and RNAse protection. 
Diminished mRNA expression indicates an alteration in the wild-type gene sequence. 
Alterations of wild-type genes can also be detected by screening for alteration of 
cofactor protein. For example, monoclonal antibodies against cofactor protein can be 
used to screen a tissue. Lack of cognate antigen would . indicate a mutation. 
Antibodies specific for products of mutant alleles could also be used to detect mutant 
gene product. These kind of immunological assays could be done in any convenient 
format known in the art. These include western blots, immunohistochemical assays 
and ELISA assays. Any means for detecting an altered cofactor protein can be used 
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to detect alteration of the wild-type cofactor gene. Functional assays such as protein 
binding determinations can be used. Moreover, assays can be used which detect the 
cofactor's biochemical function. Finding a mutant cofactor gene product indicates an 
alteration of the cofactor wild-type gene. One such binding assay would test the 
binding of cofactor protein with wild-type PXR protein. Conversely, wild-type PXR 
protein or the domain interacting with the cofactor protein can be used in a protein 
binding assay or biochemical function assay to detect normal or mutant PXR 
proteins. 

A mutant cofactor gene or gene product or a mutant PXR protein can also be 
detected in other human body samples, such as serum, stool, urine and sputum. The 
same techniques discussed above for detection of mutant genes or gene products in 
tissues can be applied to other body samples. By screening such body. samples, a 
simple early diagnosis can be achieved for the disease) resulting from a mutation in 
the cofactor gene. 

EXAMPLE 6: A CELL BASED ASSAY FOR MEASURING THE BINDING OF THE 
COFACTOR TO PXR. 

The DNA sequence encoding the open reading frame of the cofactor is transferred 
into the vector pVP16 (Clontech) to allow the expression of a fusion protein of the 
cofactor with the strong transactivation domain of the VP16 protein (of herpex 
simplex virus) in mammalian cells under the control of the strong CMV promoter. On 
another vector (the reporter), the luciferase gene is cloned under the control of a 
minimal promoter containing a PXR-responsive DNA element. This vector also 
expresses a second enzyme, e.g. beta-rgalactosidase, under the control of a 
constitutive promoter, to allow normalization for transfection efficiency between 
experiments. A third vector contains the PXR gene under the control of the strong 
CMV promoter. 

CV-1 cells are then transiently transfected with different combinations of the three 
plasmids. Transfection is done by standard methods, e.g. by use of the CalPhos 
Maximizer (Clontech, #8021-1,-2). Interaction of the cofactor protein with PXR will 
lead to a strong transactivation due to the attached VP16 domain of the cofactor 
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fusion protein. Thus, interaction of the cofactor with PXR will result in increased 
luciferase activity. Thus, inclusion of the cofactor VP16 will result in increased 
luciferase activity as compared to transfection of the PXR and the reporter alone. To 
measure this effect, extracts are prepared of the transfected cells 48 to 72 hours after 
transfection, and luciferase activity is determined. To normalize for transfection 
efficiency, beta-galactosidase activity is also determined . 

Addition of substances known or suspected to influence the binding of PXR to the 
cofactor are added to the medium of the transfected cells. These substances are 
added at different time points prior to cell lysis, typically ranging between 18 hours to 
a five minutes before cell lysis. Luciferase activity is taken as a measure of the effect 
of these substances on the binding of the cofactor to PXR. To avoid activation of 
PXR by substances contained in the serum of the medium, charcoal-stripped serum 
has to be used for these experiments. 

In. an alternative setting of the experiment the DNA-binding domain of PXR is 
replaced with the DNA-binding domain of the yeast GAL4 transcription factor. On the 
reporter plasmid, the luciferase Is expressed under the control of GAL4-responsive 
upstream activating sequences. Expression of luciferase again is an indication for 
binding of the cofactor-VP16 fusion to the PXR-GAL4 fusion/This setting is also 
referred to as the mammalian two hybrid system. A description of the experiment is 
found in the manual to the Mammalian MATCHMAKER Two-Hybrid Assay Kit from 
Clontech, # PT3002-1 , catalogue #K1 602-1) 

Substances activating nuclear receptors cause an exchange of the proteins bound to 
the receptors, thus effecting the dissociation of some proteins and promoting the 
binding of other proteins. Thus, in the experiments as described above/one can test 
for PXR-activatirig compounds and PXR-inactivating compounds by monitoring the 
binding of the cofactor to PXR. 

In an alternative setting, stably transfected cell lines are used which contain copies of 
the two different expression constructs for PXR and the cofactor as well as the 
reporter construct stably integrated into the chromosomes of the cells. 
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EXAMPLES: A FRET ASSAY USING COFACTOR PROTEINS 

DNA sequences encoding the open reading frame of the cofactor and the PXR gene 
are each transferred separately into the vector pENTRY (Life Technologies) to allow 
efficient construction of a diverse set of expression. constructs. The open reading 
frame is then recombined into the vector pDESTI 7 for expression in E. coli strain 
BL21 as a fusion protein to a six-histidine tag induced by IPTG, as well as into the 
pDEST15 for expression as a fusion protein with glutathione S-transferase (GST). 
The plasmids pDEST15, pDEST17and pENTRY are purchased from LIFE 
TECHNOLOGIES. Alternatively, the open reading frame is introduced into the vector 
pLV-CBDgw for expression as a fusion protein with the calmodulin binding protein 
using recombinant baculoviruses as specified by the manufacturer (Becton 
Dickinson). pLV-CBDgw is a derivative of the vector pLV1392 (Becton Dickinson) 
which is modified by the insertion of a calmodulin binding protein fragment, followed 
by the sequence required for recombinational cloning via the Gateway system (Life 
Technologies). Protein expression is induced and recombinant protein is purified by 
passage over a Ni-NTA-coIumn, or a glutathione columnar a calmodulin column, 
respectively. 

To measure the interaction of the two proteins, a biotinylated (Biotintag Micro 
biptinylation Kit, Sigma) His-tagged PXR protein and the GST fusion of the cofactor 
are mixed at 0.2-5 pM. Antibody to the GST protein is added which is labelled by the 
europium chelate at a concentration of 1-3 (typical 2.5) nM. Streptavidin which is 
fluorescently labeled by covalent attachment of allophycocyanin is added at a 
concentration of 5-30 pg/ml (typical 10pg/ml). The europium chelate is stimulated by 
a flash of light (320nm) and, the emitted light is measured in a delayed (50-200 ps) 
time window for 300 to 600 ps after the flash at 615 nm (fluorescence of europium 
chelate) and 655nm (fluorescence of APC). Since APC is only excited by the light 
emitted by the europium chelate, a close proximity of the two different fluorophores is 
required for excitation. The strength of the APC signal, as well as the ratio of the 
signals from the two fluorophores (i.e. the ratio of the intensities of light emitted at 
655 and 615nm) serves as a measure for the interaction of the two proteins. 
R action buffers contain 20mM TrisHCI pH 7.9, 60mM KCI, 4mM MgCfe- Reaction 
volume is 25pl. The Wallac VictorV fluorimeter is used for the fluorimetric 
measurements. 



SUBSTITUTE SHEET (RULE 26) 



WO 02/18420 



54 



PCT/EPO 1/09488 



In an alternative setting, the cofactor is used as a biotinylated His-tagged protein, and 
the PXR protein is used as fusion to GST. In yet another setting, the His-tagged 
proteins are replaced by the same proteins fused to the calmodulin binding protein. In 
the latter case, the detection of the interaction is via biotinylated calmodulin, which is 
in turn binding to APC-coupled streptayidin. Calcium has to be included in the buffer 
in the form of 4mM CaCl 2 , to allow complex formation between calmodulin and the 
calmodulin binding protein. 
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FIGURE CAPTIONS: 

Fig. 1 shows sequences from the cofactor CF1 according to the invention, Le cDNA 
sequence, reverse complement of the cDNA sequence and protein sequence. . 
Fig. 2 shows sequences from the cofactor CF2 according to the invention Le cDNA 
sequence, reverse complement of the cDNA sequence and protein sequence- 
Fig. 3 shows sequences from the cofactor CF3 according to.the invention Le cDNA 
sequence, reverse complement of the cDNA sequence and protein sequence.. 
Fig. 4 shows sequences from the cofactor CF4 according to the invention Le cDNA 
sequence, reverse complement of the cDNA sequence and protein sequence.. 
Fig. 5 shows sequences from PXR. 
Fig. 6 shows the Ligand Binding Domain from PXR. 
Fig. 7 shows sequences from RXR alpha. 
Fig. 8 shows sequences from RXR beta. 
Fig. 9 shows sequences from RXR gamma. 

Fig. 10 shows sequences from the cofactor CF44 according to the invention Le cDNA 
sequence, reverse complement of the cDNA sequence and protein sequence.. 
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CLAIMS: 

1 . An isolated nucleic acid molecule coding for a cofactor of the pregnane x nuclear 
receptor which is selected from the group comprising: 

a) the nucleotide sequences set forth in SEQ ID NOs: 1 and 28; 

b) or complements thereof as set forth in SEQ ID NOs: 2 and, 29; 

c) a nucleic acid which hybridizes to a nucleic acid having a nucleotide 
sequence which is the complement of the nucleotide sequence of SEQ ID 
NOs 1 and 28 under conditions of high stringency, and 

d) a nucleic acid which hybridizes to a nucleic acid having a nucleotide 
sequence which is the complement of the nucleotide sequence of SEQ ID 
NOs: 2 and 29 under conditions of high stringency. 

2. The isolated nucleic acid molecule of claim 1 which is genomic DNA. 

3. The isolated nucleic acid molecule of claim 1 which is cDNA. 

4. The isolated nucleic acid molecule of claim 1 which is RNA. 

5. An isolated nucleic acid molecule comprising the nucleic acid molecule of any of 
claims 1 to 4 and a label attached thereto. 

6. A vector comprising the nucleic acid molecule of claim 1 . 

7. The vector of claim 6, which is an expression vector. 

8. A host cell transfected with the vector of claim 6 or 7. 

9. A host cell transfected with the expression vector of claim 7. 
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10. A method of producing a polypeptide comprising the step of culturing the host 
cell of claim 9 in an appropriate culture medium to, thereby, produce the 
polypeptide. 

11. An isolated polypeptide encoded by any portion of the nucleic acid of claim 1 . 

12. An isolated polypeptide selected from the group comprising: 

the amino acid sequences set forth in SEQ ID NOs.; 3 and/or 30. 

1 3. Complex comprising a cofactor polypeptide according to any of SEQ ID NOs. 
3 or 30 or a portion thereof additionally, 

comprising a PXR polypeptide according to any of SEQ ID NOs. 15 or 
1 8 or a portion thereof. 

14. Complex comprising a cofactor polypeptide according to any of SEQ ID NOs. 
3 or 30 or a portion thereof additionally, 

comprising a PXR polypeptide according to any of SEQ ID NOs. 15 or 
18 of a portion thereof additionally, 

comprising a RXR polypeptide according to any of SEQ ID NOs. 21 , 24 
or 27 or a portion thereof. 

15. Complex comprising a cofactor polypeptide according to any of SEQ ID NOs. 
3 or 30 or a portion thereof additionally, 

comprising a RXR polypeptide according to any of SEQ ID NOs. 21 , 24 
or 27 or a portion thereof. 

16. A method for screening for agents which are capable of inhibiting the 
cellular function of the cofactor CF1 and/or CF44, comprising the steps of: 

a) contacting one or more candidate agents with a polypeptide according'to 
claims 11,12 or a complex according to claims 13, 14, or 15, 

b) removing unbound agent(s) 

c) detecting whether the agent(s) interact with the polypeptide of the cofactor. 
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1 7. A method for screening for agents which are capable of inhibiting or activating 
the cellular function of PXR, comprising the steps of: 

a) contacting one or more candidate agents which are capable of binding a 
complex according to claims 13, 14, or 15, with a complex according to 
claims, 13, 14 or 15, 

b) removing unbound agent(s), 

c) detecting the amount of the polypeptide according to any of claims 11 or 12 
of the cofactor that has remained bound within the complex and 

d) identifying such agents capable of either: i) releasing a large amount the 
polypeptide according to any of claims 1 1 or 12 of the cofactor from the 
complex, or ii) promoting the association of polypeptides according to any 
of claims 1 1 or 12 of the cofactor to the complex. 

18. Agent identified by the method according to claim 17. 

1 9. A method for inhibiting or activating the cellular function of the cofactor CF1 , 
and/or CF44, comprising the steps of: 

a) contacting a cell with a binding agent that binds the polypeptide according 
to claim 11, 12 or the complex according to claims 13,14 or 15, 

b) whereby the cellular function of CF1 , CF2, CF3 or CF4 is inhibited or 
activated. 

20. A method for inhibiting or activating the binding of the cofactor CF1 and/qr 
CF44, to a PXR polypeptide according to any of SEQ ID NOs. 15 or 18 or a 
portion thereof comprising the steps of: 

a) contacting the polypeptide according to claim 1 1 , 12 or the complex 
according to claims 13,14 or 15 with a binding agent, 

b) whereby the binding of CF1 or CF44 to the PXR polypeptide is inhibited or 
activated. 
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21. Method according to claim 19 or 20, 

characterized in that the binding agent is an antibody. 

22. Method according to claim 1 9 or 20, 

characterized in that the binding agent is RNA. 

23. Method according to claim 19 or 20, 

characterized in that the binding agent is an anti-sense oligonucleotide. 

24. Method according to claim 19 or 20, 

characterized in that the binding agent is a ribozyme. 

25. Method according to claim 19 or 20, 

characterized in that the binding agent is a steroid molecule. 

■ ■ •/ 

26. Method according to claim 19 or 20, 

characterized in that the cell is in a body. 

27. A method for predicting the ability of a human being to metabolise xenobiotic 
substances or drugs comprising the steps of: 

a) screening the genes coding for CF1 and CF44 for genetic aberrations 
or polymorphisms, 

b) whereby the existence of a genetic aberration or polymorphism will 
predict an altered ability to metabolise xenobiotic substances or drugs. 

28. Use of the proteins or a portion thereof according to SEQ ID NO. 3 
and/or SEQ ID NO. 30 or a complex according to claims 13, 14 or 15 for the 
screening for substances that bind said proteins or portions thereof or 
complexes. 

29. Use according to claim 28 wherein the screening is for agonists or antagonist 
ofPXR. 
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Fig-1 . 
CF1: 

SEQIDNO. 1.: 

CTTGCTCTGGCTGTTTCTGCCCCTGGGTTAACATTCAAGATGGTACATGCTGA' 
AGCCTTTTCTCGTCCTTTGAGTGGGAATGAAGTTGTTGGTTTAATTTTCCGTT 

tgacaatatttggtgcagtgacatactttactatcaaatggatggtagatgca! 
attgatcc^ccagaaagc^aaaagtagaagctcagaaacaggcagaaaaact 
aatgaagcaaattggagtgaaaaatgtgaagctctcaaaatatgaaatgagta' 
ttgctgctcatcttgtagaccctcttaatatgcatgttacttggagtgatata- 
gc^ggtttagatgatgtcattacggatctgaaagacacagtcatcttacctat; 
- caaaaagaaacatttgtttgagaattccaggcttctgcagcctccaaaaggtgl 
ttcttctctatgggcctccaggctggggtaaaacgttgattgccaaggccaca' 
gccaaagaagcaggctgtccatttattaaccttcagccttcgacactgaccga'. 
taagtggtatggagaatctcagaaattggctgctggtgtcttatgccttgcgat.' 
taaagctagaaccatccatcatctttatagatggaaatagactccttttttac,, 
gaaaccgttcaagttctgaccatgaaagctacagcccat . ■ 

SEQIDNO. 2: y 

REVERSE COMPLEMENT 

ATGGGCTGTAGCTTTCATGGTCAGAACTTGAACGGTTTCGTAAAAAAGGAGTC 

TATTTCCATCTATAAAGATGATGGATGGTTGTAGCtTTATGGCAAGGGATAAG 

ACAGCAGCAGCCAATTTCTGAGATTCTCCATACCACTTATCGGTCAGTGTCGA 

AGGCTGAAGGTTAATAAATGGACAGCCTGCTTCTTTGGCTGTGGCCTTGGCAA 

TCAACGTTTTACCCCAGCCTGGAGGCCCATAGAGAAGAACACCTTTTGGAGGC 

TGCAGAAGCCTGGAATTCTCAAACAAATGTTTCTTTTTGATAGGTAAGATGAC 

TGTGTCTTTCAGATCCGTAATGACATCATCTAAACCTGGTATATCACTCCAAG 

TAACATGCATATTAAGAGGGTCTACAAGATGAGCAGCAATACTCATTTCATAT 

TTTGAGAGCTTCACATTTTTCACTCCAATTTGCTTCATTAGTTTTTCTGCCTG 

TTTCTGAGCTTCTACTTTTTGCTTTCTGGTTGGATCAATTGCATCTACCATCC 

ATTTGATAGTAAAGTATGTCACTGGACCAAATATTGTCAAACGGAAZ^TTAAA' 

CCAACAACTTCATTGCGACTCAAAGGACGAGAAAAGGCTTCAG(^TGTACCAT 

CTTGAATGTTAACCGAGGGGCAGAAACAGCCAGAGCAAG 

SEQ ID NO. 3: 
PROTEIN 

J^LAVSAPGLTFKMVHAEAFSRPLSRNEWGLIFRLTI^ 
IDPTRKQKVEAQKQAEKLMKQlGVKNVEOj^ 

AGLDDVITDLKDTVILPIKKKHLFENSRLLQPPKGVLLYGPPGWGKTLIAKAT 
AKEAGCPFI^QPSTLTDKWyGESQKLAAAVLSLAIKLQPSIIFIDGNRLLFY 
ETVQVLTMKATAH 
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Fig. 2 
CF2: 

SEQ ID NO. 4: 

GATTTAGTAAGTCACCATGTGAGGACAAAACTTGATQAACTGAAAAGGCAAGA 
AGTAGGAAGGTTAAGAATGTTAATTAAAGCTAAGTTGGATTCCCTTCAAGATA 
TAGGCATGGACCACGAAGCTCTTCTAAAACAATTTGATCACCTAAACCACCTG 
AATCCTGAC^y^GTTTGAATCGACAGATTTAGATATGCTAATCAAAGCGGCAAC 
AAGTGATCTGGAACACTATGACAAGACTCGTCATGAAGAATTTAAAAAATATG 
AAATGATGAAGGAACATGAAAGGAGAGAATATTTAAAAACATTGAATGAAGAA 
AAGAGAT^AAGAAGAAGAGTCTAAATTTGAAGAAATGAAGAAAAAGCATGAAAA 
TCACCCTAAAGTTAATCACCCAGGAAGCAAAGATCAACTAAAAGAGGTATGGG 
AAGAGACTGATGGATTGGATCCTAATGACTTTGACCCCAAGACATTTTTCAAA 
TTACATGATGTCAATAGTGATGGATTCCTGGATGAACAAGAATTAGAAGCCCT 
ATTTACTAAAGAGTTGGAGAAAGTTTATGACCCTAAAAATGAAAAGGATGATA 
TGGTAGAAATGGAAGAAGAAAGGCTTAAAATGAGGGAACATGTAATGAATGAG 
GTTGATACTAACAAAGAC^GATTGGTGACTCTTGGAGGAGTTTTTGAAAGCCA 
CAGAAAAAAAAAGAATTTTTGGAGCCCAGATAGCTGGGA 

SEQ ID NO. 5: 
REVERSE COMPLEMENT 

TCCCAGCTATCTGGGCTC<^AAAATTCTTTTTTTTTCTGTGGCTTTCAAAAAC 
TGCTCCAAGAGTCACCAATCTGTCTTTGTTAGTATCAACCTCATTCATTACAT 
GTTCCCTCATTTTAAGCCTTTCTTCTTCGATTTCTACCATATCATCCTTTTCA 
TTTTTAGGGTCATAAACTTTCTCCAACTCTTTAGTAAATAGGGCTTCTAATTC 
TTGTTCATCCAGGAATCCATCACTATTGACATCATGTAATTTGAAAAATGTCT 
TGGGGTCAAAGTCATTAGGATCCAATCCATCAGTCTCTTCCCATACCTCTTTT 
AGTTGATCTTTGCTTCCTGGGTGATTAACTTTAGGGTGATTTTCATGCTTTTT 
CTTCATTTCTTCAAATTTAGACTCTTCTTCTTTTCTCTTTTCTTCATTCAATG 
TTTTTAAATATTCTCTCCTTTCATGTTCCTTCATCATTTCATATTTTTTAAAT 
TCTTCATGACGAGTCTTGTCATAGTGTTCCAGATCACTTGTTGCCGCTTTGAT 
TAGCATATCTAAATCTGTGGATTC^AACTTGTCAGGATTCAGGTGGTTTAGGT 
GATCAAATTGTTTTAGAAGAGCTTGGTGGTCCATGCCTATATCTTGAAGGGAA 
TCCAACTTAGCTTTAATTAACATTCTTAACCTTCCTACTTCTTGCCTTTTCAG 
TTCATCAAGTTTTGTCCTCACATGGTGACTTACTAAATC 

SEQ ID NO. 6: 
PROTEIN: 

DLVSHHVRTKLDELKRQEVGRLRMLIKAKLDSLQDIGMDHQALLKQFDHLNHL 

NPDKFESTDLDMLIKAATSDLEHYDKTRHEEFKKYEMMKEHERREYLKTLNEE 

KRKEEESKFEEMKKKHENHPKVNHPGSKDQLKEVWEETDGLDPNDFDPKTFFK 

LHDVNSDGFLDEQELEALFTKELEKVYDPKNEKDDMVEMEEERLKMRE 

VDTNKDRLVTLGGVFESHRKKKNFWSPDSW 
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Fig. 3 
CF3: 

SEQ ID NO. 7: 

TTTGTCAGACACCACGTCCGCACAAAGCTGGATGAGCTCAAGCGACAGGAGGT 

GTCACGGCTGCGGATGCTGCTCAAGGCCAAGATGGACGCCGAGCAGGATCCCA 

ATGTACAGGTGGATCATCTGAATCTCCTGAAACAGTTTGAACACCTGGACCCT 

CAGAACCAGCATACATTCGAGGCCCGCGACCTGGAGCTGCTGATCCAGACGGC 

eACCCGGGACCTTGCCCAGTACGACGCAACCCATCATGAAAAGTTCAAGCGCT 

ACGAGATGCTTAAGGAACAGGAGAGACGGCGTTATCTGGAGTCACTGGGAGAG 

GAGCAGAGAAAGGAGGCGGAGAGGAAGCTGGAAGAGCAACAGCGCCGGCACCG 

CGAGCACCCTAAAGTCAACGTGCCTGGCAGCCAAGCGCAGTTGAAGGAGGTGT 

GGGAGGAGCTGGATGGACTGGACCCCAACAGGTTTAACCCCAAGACCTTCTTC * 

ATACTGCATGATATCAACAGTGATGGTGTCCTGGATGAGCAGGAGCTGGAGGC 

ACTGTTCACCAAGGAGCTGGAGAAAGTGTACGACCCAAAGAATGAGGAGGACG 

ACATGCGGGAGATGGAGGAGGAGCGACTGCGCATGCTGAAGCATGTGATGAAG ; 

AATGTGGACACCCAACCAGGACCG 

SEQ ID NO. 8: 

REVERSE COMPLEMENT: 

CGGTCCTGGTTGGGTGTCCACATTCTT.CATCACATGCTTCAGCATGCGCAGTC 

GCTCCTCCTCCATCTCCCGCATGTCGTCCTCCTCATTCTTTGGGTCGTACACT 

TTCTCCAGCTCCTTGGTGAAGAGTGCCTCCAGCTCCTGCTCATCCAGGACACC ; 

ATCACTGTTGATATCATGCAGTATGAAGAAGGTCTTGGGGTTAAAC CTGTTGG , 

GGTCCAGTCCATCCAGCTCCTCCCACACCTCCTTCAACTGGGCTTGGCTGCCA 

GGCACGTTGACTTTAGGGTGCTCGCGGTGCCGGCGCTGTTGCTCTTCCAGCTT 

CCTCTCCGCCTCCTTtCTCTGCTCCTCTCCCAGTGACTCCAGATAACGCCGTC 

TCTCGTGTTCCTTAAGCATCTCGTAGCGCTTGAACTTTTCATGATGGGTTGCG 

TCGTACTGGGCAAGGTCCCGGGTGGCCGTCTGGATCAGCAGCTCCAGGTCGCG 

GGCCTCGAATGTATGCTGGTTCTGAGGGTCCAGGTGTTCAAACTGTTTCAGGA 

GATTCAGATGATCCACCTGTACATTGGGATCCTGCTCGGCGTCGATCTTGGCC 

TTGAGCAGCATCCGCAGCCGTGACACCTCCTGTCGCTTGAGCTCATCCAGCTT. 

TGTGCGGACGTGGTGTCTGACAAA 

SEQ ID NO. 9: 
PROTEIN: 
FVRHHVRTKLDELKRQEVSRLRMLLKAK^ 

QNQHTFEARDLELLIQTATRDIiAQYDATHHEKFKRYEMLKEHERRRYLESLGE 
EQRKEAERKLEEQQRRHREHPKVNVPGSQAQLKEVWEELDGLDPNRFNPKTFF 
ILHDINSDGVLDEQELEALFTKELEKVYDPKl^EDDMREMEEERLRMLKHVMK 

NVDTQPGP 
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Fig. 4 
CF4: 



SEQ ID NO. 10: 

GGGGACTCGGCCCTGAACGAGCAGGAGAAGGAGTTGCAGCGGCGGCTGAAGCG 
TCTCTACCCGGCCGAGGACGAACAAGAGACGCCGCTGCCTAGGTCCTGGAGCC 
CGAAGGACAAGTTCAGCTACATCGGCCTCTCTCAGAACAACCTGCGGGTGCAC 
TACAAAGGTCATGGCAAAACCCCAAAAGATGCCGCGTCAGTTCGAGCCACGCA 
TCCAATACCAGCAGCCTGTGGGATTTATTATTTTGAAGTAAAAATTGTCAGTA 

agggaagagatggttacatgggaattggtctttctgctcaaggtgtgaacatg 

AATAGACTACCAGGTTGGGATAAGGATTCATATGGT.TAGCATGGGGATGATGG 
ACATTCGTTTTGTTCTTCTGGAACTGGACAACCTTATGGACCAACTTTCACTA 
CTGGTGATGTCATTGGCTGTTGTGTTAATCTTATCAACAATACCTGCTTTTAC 
ACCAAGAATGGACATAGTTTAGGTATTGCTTTACTGACCTACCGCCAAATTTG 
TATCCTACTGTGGGGCTTCAAAGACCAGGAGAAGTGGTCGATGCCAATTTTTG 
. GGCAACATCGTTTCCGTGTTTGATATAAAAAACTATATGCCGGGAGTGGAGAA 
CCAAAATCCAGGCCCCAGATAGATCCGATTTCCT 

SEQ ID NO. 11: 
REVERSE COMPLEMENT 

AGGAAATCGGATCTATCTGGGGCCTGGAfTTTGGTTCTCCACTCCCGGCATAT 
AGTTTTTT ATATGAAACACGGAAAGGATGTTGCCCAAAAATTGGCATGGACC A 
CTTCTCCTGGTGTTTGAAGCCCCACAGTAGGATACAAATTTGGCGGTAGGTCA 
» GTAAAGCAATACCTAAACTATGTCCATTCTTGGTGTAAAAGGAGGTATTGTTG 
ATAAGATTAACACAACAGGCAATGACATCACCAGTAGTGAAAGTTGGTCCATA 
AGGTTGTGCAGTTCCAGAAGAACAAAACGAATGTCCATCATCCCCATGGTAAC 

catatgaatgcttatcccaacctggtagtctattcatgttcacaccttgagca 

GAAAGACCAATTCCCATGTAACCATCTCTTCCCTTACTGACAATTTTTACTTC 
AAAATAATAAATCCCACAGGCTGCTGGTATTGGATGCGTGGCTCGAACTGACG 
CGGGATCTTTTGGGGTTTTGCCATGACCTTTGTAGTGCACCCGCAGGTTGTTC 
TGAGAGAGGC CGATGTAGCTGAACTTGTGCTTCGGGCTCCAGGACCTAGGCAG 
GGGCGTCTCTTGTTCGTCCTCGGCCGGGTAGAGACGCTTCAGCCGCCGCTGCA 
ACTCCTTCTCCTGCTCGTTCAGGGCCGAGTCCCC 

SEQ ID NO. 12: 
PROTEIN 

GDSALNEQEKELQRRLKRLYPAEDEQETPLPRSWSPKDKFSYIGLSQNNLRVH 
yKGHGKTPKDAASVRATHPlPAACGIYYFEVKIVSKGRDGYMGIGLSAQGVNM 
NRLPGWDKHSYGYHGDDGHSFCSSGTGQPYGPTFTTGDVIGCCVNLINNTCFY 
TKNGHSLGIALLTYRQICILLWGFKHQEKWSMPIFGQHPFRV 
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Fig- 5 „. . 

PXR (ORFV: 

SEQIDNO. 13: 

ATGACATGTGAAGGATGCAAGGGCTTTTTCAGGAGGGCCATGAAACGCAACGC 
CCGGCTGAGGTGCCCCTTCCGGAAGGGCGCCTGCGAGATCACCCGGAAGACCC 
GGCGACAGTGCCAGGCCTGCCGCCTGCGCAAGTGCCTGGAGAGCGGCATGAAG 
AAGGAGATGATCATGTCCGACGAGGCCGTGGAGGAGAGGCGGGCCTTGATCAA 
GCGGAAGAAAAGTGAACGGACAGGGACTCAGCCACTGGGAGTGCAGGGGCTGA 
CAGAGGAGCAGCGGATGATGATCAGGGAGCTGATGGACGCTCAGATGAAAACC 
TTTGACACTACCTTCTCCCATTTCAAGAATTTCCGGCTGCCAGGGGTGCTTAG 
CAGTGGCTGCGAGTTGCCAGAGTCTCTGCAGGCCCCATCGAGGGAAGAAGCTG 
CCAAGTGGAGCCAGGTCCGGAAAGATCTGTGCTCTTTGAAGGTCTCTCTGCAG 
CTGCGGGGGGAGGATGGCAGTGTCTGGAACTACAAACCCCCAGCCGACAGTGG 
CGGGAAAGAGATCTTCTCCCTGCTGCCCCACATGGCTGACATGTCAACCTACA 
TGTTCAAAGGCATCATCAGCTTTGCCAAAGTCATCTCCTACTTCAGGGACTTG 
CCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCTTTGGAGCTGTGTGA 
ACTGAGATTCAACACAGTGTTCAACGCGGAGACTGGAACCTGGGAGTGTGGCC 
GGCTGTCCTACTGCTTGGAAGACACTGCAGGTGGCTTCGAGCAACTTCTACTG 
GAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTGCATGAGGA 
GGAGTATGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAGACCGCCCAGGTG 
TGGTGCAGCACCGCGTGGTGGACCAGCTGCAGGAGCAATTCGCCATTACTCTG 
AAGTCCTACATTGAATGCAATCGGCCCCAGCCTGCTCATAGGTTCTTGTTCCT 
GAAGATCATGGCTATGCTCACCGAGCTGCGCAGCATCAATGCTCAGCACAGCC 
AGCGGCTGCTGCGCATCCAGGACATACACCCCTTTGCTACGCGCCTCATGCAG 
GAGTTGTTCGGCATCACAGGTAGCTGA 

PXR Reverse complement: 
SEQIDNO. 14: 

AGTCGATGGACACTACGGCTTGTTGAGGACGTACTCCCGGCATCGTTTCCCCA 
CATACAGGACCTACGCGTCGTCGGCGACCCACACGACTCGTAACTACGACGCC 
TGGAGCCACTCGTATCGGTACTAGAAGTCCTTGTTCTTGGATACTCGTCCGAC 
CCCGGCTAACGTAAGTTACATCCTGAAGTCTCATTACCGCTTAACGAGGACGT 
CGACCAGGTGGTGCGCCACGACGTCGTGTGGACCCGCCAGACCCCTCTTCTCC 
CTCTACCGGACGTAGTCGTGTATGAGGAGGAGTACGTCGACGTCGAAGAAGTC 
GTACATCACCTTAAAGTCGTACCCGAGGTCATCTTCAACGACCTTCGGTGGAC 
GTCACAGAAGGTTCGTCATCCTGTCGGCCGGTGTGAGGGTCCAAGGTCAGAGG 
CGCAACTTGTGAC^CAACTTAGAGTCAACTGTGTCGAGCTTTCGCCGGGGGAA 
GTCGTCCCTCTAGACCAGGAGCTACCCGTTCAGGGACTTCATCCTCTACTGAA 
ACCGTTTGGACTACTACGGAAACTTGTAGATCCAACTGTACAGTCGGTACACC 
CCGTCGTCCCTCTTCTAGAGAAAGGGCGGTGACAGCCGACCCCCAAACATCAA 
GGTCTGTGACGGTAGGAGGGGGGCGTCGACGTCTCTCTGGAAGTTTCTCGTGT 
CTAGAAAGGCCTGGACCGAGGTGAACCGTGGAAGAAGGGAGCTACCCCGGACG 
TCTCTGAGACCGTTGAGCGTCGGTGACGATTCGTGGGGACCGTCGGCCTTTAA 
GAACTTTACCCTCTTCCATCACAGTTTCCAAAAGTAGACTCGCAGGTAGTCGA 
GGGACTAGTAGTAGGCGACGAGGAGACAGTCGGGGACGTGAGGGTCACCGACT 
CAGGGACAGGCAAGTGAAAAGAAGGCGAACTAGTTCGGGGCGGAGAGGAGGTG 
CCGGAGCAGCCTGTACTAGTAGAGGAAGAAGTACGGCGAGAGGTCCGTGAACG 
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CGTCCGCCGTCCGGACCGTGACAGCGGCCCAGAAGGCCCACTAGAGCGTCCGC 
GGGAAGGCCTTCCCCGTGGAGTCGGCCCGCAACGCAAAGTACCGGGAGGACTT 
TTTCGGGAACGTAGGAAGTGTACAGTA 

PXR-Protein 
SEQIDNO. 15: . 

MEVRPKESWNHADFVHCEDTESVPGKPSVNADEEVGGPQICRVCGDKATGYHF 
NVMTCEGCKGFFRRAMKRNARLRCPFRKGACEITRKTRRQCQACRLRKCLESG 
fQCKEMIMSDEAVEERRALIKRKKSERTGTQPLGVQGIiTEEQRMMIRELMDAQM 
KTFDTTFSHFKNFRLPGVLSSGCELPESLQAPSREEAAKWSQVRKDLCSLKVS 
LQLRGEDGSVWNYKPPADSGGKEIFSLLPHMADMSTYMFKGIISFAKVISYFR 
DLPIEDQISLLKGAAFELCQLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQL 
LLEPMLKFHYMLKKLQLHEEEYVLMQAISLFSPDRPGVLQHRVVDQLQEQFAI 
TLKSYIECNRPQPAHRFLFLKIMAMLTELRSINAQHTQRLLRIQDIHPFATPL 
MQELFGITGS 
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PXR (Liqand Binding Domain LBD) 

SEQ ID NO. 16: 



PGT/EPO 1/09488 



GGCATGAAGAAGGAGATGATCATGTCCGACGAGGCCGTGGAGGAGAGGCGGGC 
CTTGATCAAGCGGAAGAAAAGTGAACGGACAGGGACTCAGCCACTGGGAGTGe 
AGGGGCTGACAGAGGAGCAGCGGATGATGATCAGGGAGCTGATGGACGCTCAG: 
ATGAAAACCTTTGACACTACCTTCTCCCATTTCAAGAATTTCCGGCTGCCAGG 
GGTGCTTAGCAGTGGCTGCGAGTTGCCAGAGTCTCTGCAGGCGCCATCGAGGG 
AAGAAGCTGCCAAGTGGAGCCAGGTCCGGAAAGATCTGTGCTCTTTGAAGGTC 
TCTCTGCAGCTGCGGGGGGAGGATGGCAGTGTCTGGAACTACAAACCCCGAGC 
CGACAGTGGCGGGAAAGAGATCTTCTCCCTGCTGCCCCACATGGCTGACATGT 
CAACCTACATGTTCAAAGGCATCATCAGCTTTGCCAAAGTCATCTCCTACTTC 
AGGGACTTGCCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCTTTCGA . : 
GCTGTGTCAACTGAGATTCAACACAGTGTTCAACGCGGAGACTGGAACCTGGG ' 
AGTGTGGCCGGCTGTCCTACTGCTTGGAAGACACTGCAGGTGGCTTCCAGCAA 
CTTCTACTGGAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTi' 
-GCATGAGGAGGAGTATGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAGACC , 
GCCCAGGTGTGCTGCAGCACCGCGTGGTGGACCAGCTGCAGGAGCAATTCGCC 
ATTACTCTGAAGTCCTACAT.TGAATGCAATCGGCCCCAGCCTGCTCATAGGTT : i 
CTTGTTCCTGAAGATCATGGCTATGCTCACCGAGCTCCGCAGCATCAATGCTC 
AGCACACCCAGCGGCTGCTGCGCATCCAGGACATACACCCCTTTGCTACGCCC " 
CTCATGCAGGAGTTGTTCGGCATCACAGGTAGCTGA : v 

PXR-LBD reverse complement 
SEQ ID NO. 17: 

tcagctacctgtgatgccgaacaactcctgcatgaggggcgtagcaaaggggt 

gtatgtcctggatgcgcagcagccgctgggtgtgctgagcattgatgctgcgg 

agctcggtgagcatagccatgatcttcaggaacaagaacctatgagcaggctg 

gggccgattgcattcaatgtaggacttcagagtaatggcgaattgctcctgca ' 

gctggtccaccacgcggtgctgcagcacacctgggcggtctggggagaagagg - 

gagatggcctgcatcagcacatactcctcctcatggagctgcagcttcttcag ? 

catgtagtggaatttcagcatgggctccagtagaagttgctggaagccacctg / 

cagtgtcttccaagcagtaggacagccggccacactcccaggttccagtctcc 

gcgttgaacactgtgttgaatctcagttgacacagctcgaaagcggccccctt; 

cagcagggagatctggtcctcgatgggcaagtccctgaagtaggagatgactt> 

tggcaaagctgatgatgcctttgaacatgtaggttgacatgtcagccatgtgg 

ggcagcagggagaagatctctttcccgccactgtcggctgggggtttgtagtt 

ccagacactgccatcctccccccgcagctgcagagagaccttcaaagagcaca' 

gatctttccggacctggctccacttggcagcttcttccctcgatggggcctgc 

agagactctggcaactcgcagccactgctaagcacccctggcagccggaaatt 

cttgaaatgggagaaggtagtgtcaaaggttttcatctgagcgtccatcagct 

ccctgatcatcatccgctgctcctctgtcagcccctgcactcccagtggctga 

gtccctgtccgttcacttttcttccgcttgatcaaggcccgcctctcctccac 

ggcctcgtcggacatgatcatctccttcttcatgcc 
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PXR-LBD Protein: 
SEQ ID NO. 18: 

GMKKEMIMSDEAVEERRALIKRKKSERTGTQPLGVQGLTEEQRMMIRELMDAQ 
MKTFDTTFSHFKNFRLPGVLSSGCELI?ESLQAPSREEAAKWSQVRKDLCSLKV 

. SLQLRGEDGS VWNYKPPADSGGKEIFSLLPHMADMSTYMFKGI I S FAKVIS YF 
RDLPIEDQISLLKGAAFELCQLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQ 
LLLEPMLKFHYMLKKLQLHEEEYVLMQAISLFSPDRPGVLQHRVVDQLQEQFA 

. ITLKSYIECNRPQPAHRFLFLKIMAMLTELRS INAQHTQRLLRIQDIHPFATP 
LMQELFGITGS ' 
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RXRalpha (ORB: 

SEQ ID NO. 19: 

ATGGACACGAAACATTTCCTGCCGCTCGATTTCTCCACCCAGGTGAACTCCTC 
CCTCACCTCCCCGACGGGGCGAGGCTCCATGGCTGCCCCCTCGCTGCACCCGT 
CCCTGGGGCCTGGCATCGGCTCCCCGGGACAGCTGCATTCTCCCATCAGCACC 
CTGAGCTCCCCC^TC?^ACGGCATGGGCCCGCCTTTCTCGGTCATCAGCTCCCC 
C^TGGGCCCCCACTC<^TGTCGGTGCCC^CCACACCCACCCTGGGCTTCAGCA 
CTGGCAGCCCCCAGCTCAGCTCACCTATGAACCCCGTCAGCAGCAGCGAGGAC 
ATCAAGCCCCCCCTGGGCCTGAATGGCGTCCTCAAGGTCCCCGCCCACCCCTC 
.AGGAAACATGGCTTCCTTGACCAAGGAO^TCTGCGCCATCTGCGGGGACCGCT 
CCTCAGGCAAGCACTATGGAGTGTACAGCTGCGAGGGGTGCAAGGGCTTCTTC 
AAGCGGACGGTGCGCAAGGACCTGACCTACACCTGCCGCGACAACAAGGACTG 
CCTGATTGACAAGCGGCAGCGGAACCGGTGCCAGTACTGCCGCTACCAGAAGT 
GCCTGGCCATGGGCATGAAGCGGGAAGCCGTGCAGGAGGAGCGGCAGCGTGGC 
AAGGACCGGAACGAGAATGAGGTGGAGTCGACCAGCAGCGCCAACGAGGACAT 
-GCCGGTGGAGAGGATCCTGGAGGCTGAGCTGGCCGTGGAGCCCAAGACCGAGA 
CCTACGTGGAGGCAAACATGGGGCTGAACCCCAGCTCGCCGAACGACCCTGTC . 
ACGAACATTTGCCAAGCAGCCGACAAACAGCTTT.TCACCCTGGTGGAGTGGGC 
CAAGCGGATCCCACACTTCTCAGAGCTGCCCCTGGACGACCAGGTCATCCTGC 
TGCGGGCAGGCTGGAATGAGCTGCTCATCGCCTCCTTCTCCCACCGCTCCATC 
GCCGTGAAGGACGGGATCCTCCTGGCCACCGGGCTGCACGTCCACCGGAACAG 
CGCCCACAGCGCAGGGGTGGGCGCCATCTTTGACAGGGTGCTGACGGAGCTTG 
TGTCCAAGATGCGGGACATGCAGATGGACAAGACGGAGCTGGGCTGCCTGCGC 
GCCATCGTCCTCTTTAACCCTGACTCCAAGGGGCTCTCGAACCCGGCCGAGGT 
GGAGGCGCTGAGGGAGAAGGTCTATGCGTCCTTGGAGGCCTACTGCAAGCACA 
AGTACCCAGAGCAGCCGGGAAGGTTCGCTAAGCTCTTGCTCCGCCTGCCGGCT 
CTGCGCTCCATCGGGCTCAAATGCCTGGAACATCTCTTCTTCTTCAAGCTCAT 
CGGGGACACACCCATTGACACCTTCCTTATGGAGATGCTGGAGGCGCCGCACC 
AAATGACTTAG 

RXRalpha reverse complement: 
SEQ ID NO. 20: 

CTAAGTCATTTGGTGCGGCGCCTCGAGCATCTCCATAAGGAAGGTGTCAATGG 
GTGTGTCCCCGATGAGCTTGAAGAAGAAGAGATGTTCCAGGCATTTGAGCCCG 
ATGGAGCGCAGAGCCGGCAGGCGGAGCAAGAGCTTAGCGAACCTTCCCGGCTG 
CTCTGGGTACTTGTGCTTGCAGTAGGCCTCGAAGGACGCATAGACCTTCTCCC 
TCAGCGCCTCCACCTCGGCCGGGTTCGAGAGCCCCTTGGAGTCAGGGTTAAAG 
AGGACGATGGCGCGCAGGCAGCCCAGCTCGGTCTTGTCCATCTGCATGTCCCG 
CATCTTGGACACAAGCTCCGTCAGCACCCTGTCAAAGATGGCGCCCACCCCTG 
CGCTGTGGGCGCTGTTCCGGTGGACGTGCAGCCCGGTGGCCAGGAGGATCCCG 
TCCTTCACGGCGATGGAGCGGTGGGAGAAGGAGGCGATGAGCAGCTCATTCCA 
GCCTGCCCGCAGCAGGATGACCTGGTCGTCCAGGGGCAGCTCTGAGAAGTGTG 
GGATCCGCTTGGCCCACTCCACCAGGGTGAAAAGGTGTTTGTCGGCTGCTTGG 
CAAATGTTGGTGACAGGGTCGTTCGGCGAGCTGGGGTTCAGCCCCATGTTTGC 
CTCCACGTAGGTCTCGGTCTTGGGCTCCACGGCCAGCTCAGCCTCCAGGATCC 
TCTCCACCGGCATGTCCTCGTTGGCGCTGCTGGTCGACTCCACCTCATTCTCG 
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TTCCGGTCCTTGCCACGCTGCC.GCTCCTCCTGCACGGCTTCCCGCTTCATGCC 
CATGGCCAGGCACTTCTGGTAGCGGCAGTACTGGCACCGGTTCCGCTGCCGCT 
TGTCAATCAGGCAGTCCTTGTTGTCGCGGCAGGTGTAGGTCAGGTCCTTGCGC 
ACCGTCCGCTTGAAGAAGCCCTTGCACCCCTCGCAGCTGTACACTCCATAGTG 
CTTGCCTGAGGAGCGGTCCCCGCAGATGGCGCAGATGTGCTTGGTGAAGGAAG 
CCATGTTTCCTGAGGGGTGGGCGGGGACCTTGAGGACGCCATTGAGGCCCAGG 
GGGGGCTTGATGTCCTCGCTGCTGCTGACGGGGTTCATAGGTGAGCTGAGCTG 
GGGGCTGCCAGTGCTGAAGCCCAGGGTGGGTGTGGTGGGCACCGACATGGAGT 
GGGGGCCC^TGGGGGAGCTGATGACCGAGAAAGGCGGGCCCATGCCGTTGATG 
GGGGAGCTCAGGGTGCTGATGGGAGAATGCAGCTGTCCCGGGGAGCCGATGCC 
AGGCCCCAGGGACGGGTGCAGCGAGGGGGCAGCCATGGAGCCTCGCCCCGTCG 
GGGAGGTGAGGGAGGAGTTCACCTGGGTGGAGAAATCGAGCGGCAGGAAATGT 
TTGGTGTCCAT 

RXRalpha-Protein. 
SEQID NO. 21: 




. AVKDGILLATGLHVHRNSAHSAGVGAI FDRVLTELVSKMRDMQMDKTELGCIJR 
AIVLFNPDSKGLSNPAEVEALREKVYASLEAYCKHKYPEQPGRPAKLLLRLPA 
LRSIGLKCLEHLFFFKLIGDTPIDTFLMEMLEAPHQMT 
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RXRb ta (ORR: 

SEQ ID NO. 22: 

ATGTCTTGGGCCGCTCGCCCGCCCTTCCTCCCTCAGCGGCATGCCGCAGGGCA 
GTGTGGGGCGGTGGGGGTGCGAAAAGAAATGCATTGTGGGGTCGCGTCCCGGT 
GGCGGCGGCGACGGCCCTGGCTGGATCCCGCAGCGGCGGCGGCGGCGGCGGTG 
GCAGGCGGAGAACAACAAACCCCGGAGCCGGAGCCAGGGGAGGCTGGACGGGA 
CGGGATGGGCGACAGCGGGCGGGACTCCCGAAGCCCAGACAGCTCCTCCCCAA 
ATCCCCTTCCCCAGGGAGTCCCTCCCCCTTCTCGTCCTGGGCCACCCCTACCG 
CCTTCAACAGCTCCTACCCTTGGAGGCTCTGGGGCCCCACCCCCACCCCCGAT 
GCCACCACCCCCACTGGGCTCTCCCTTTCCAGTCATCAGTTCTTCCATGGGGT 
CCCCTGGTCTGCCCCCTCCAGCTCCCCCAGGATTCTCCGGGCCTGTCAGCAGC 
CCCCAGATTAACTCAACAGTGTCACTCCCTGGGGGTGGGTCTGGCCCCCCTGA 
AGATGTGAAGCCACCAGTCTTAGGGGTCCGGGGCCTGCACTGTCCACCCCCTC 
CAGGTGGCCCTGGGGCTGGCAAACGGCTATGTGCAATCTGCGGGGACAGAAGC 
TGAGGCAAACACTACGGGGTTTACAGCTGTGAGGGTTGCAAGGGCTTCTTeAA 
ACGCACCATCCGCAAAGACCTTACATACTCTTGCCGGGACAAGAAAGACTGCA 
CAGTGGACAAGCGCCAGCGGAACCGCTGTCAGTACTGCCGCTATCAGAAGTGC 
CTGGCCACTGGCATGAAGAGGGAGGCGGTACAGGAGGAGCGTCAGCGGGGAAA 
GGACAAGGATGGGGATGGGGAGGGGGCTGGGGGAGCCCCCGAGGAGATGCCTG 
TGGACAGGATCCTGGAGGCAGAGCTTGCTGTGGAA.CAGAAGAGTGACCAGGGC 
GTTGAGGGTCCTGGGGGAACCGGGGGTAGCGGCAGCAGCCCAAATGACCCTGT 
GACTAACATCTGTCAGGCAGCTGACAAACAGCTATTCACGCTTGTTGAGTGGG 
GGAAGAGGATCCCACACTTTTCCTGGTTGCCTGTGGATGATCAGGTCATATTG 
CTGCGGGCAGGCTGGAATGAA.CTCCTCATTGCGTCCTTTTCACACCGATCCAT 
TGATGTTCGAGATGGCATCCTCCTTGCCACAGGTCTTCACGTGCACCGCAACT 
CAGCCCATTCAGCAGGAGTAGGAGCCATCTTTGATCGGGTGCTGACAGAGCTA 
GTGTCCAAAATGCGTGACATGAGGATGGACAAGACAGAGCTTGGCTGCCTGAG 
C3GCAATCATTCTGTTTAATCCAGATGCCAAGGGCCTCTCCAACCCTAGTGAGG 
TGGAGGTCCTGCGGGAGAAAGTGTATGCATCACTGGAGACCTACTGCAAAGAG 
AAGTACCCTGAGCAGCAGGGACGGTTTGCCAAGCTGCTGCTACGTCTTCCTGC 
CCTCCGGTCCATTGGCCTTAAGTGTCTAGAGCATCTGTTTTTCTTCAAGCTCA 
TTGGTGACACCCCCATCGACACCTTGCTCATGGAGATGCTTGAGGCTCCCCAT 
CAACTGGCCTGA 

RXRbeta reverse complement: 
SEQ ID NO. 23: 

TCAGGCCAGTTGATGGGGAGCCTCAAGCATCTCCATGAGGAAGGTGTCGATGG 
GGGTGTCACCAATGAGCTTGAAGAAAAACAGATGCTCTAGACACTTAAGGCCA 
ATGGACCGGAGGGCAGGAA.GACGTAGCAGCAGCTTGGCAAACCGTCCCTGGTG 
CTCAGGGTACTTCTGTTTGCAGTAGGTCTCCAGTGATGCATACACTTTCTCCC 
GCAGGACCTCCACCTCACTAGGGTTGGAGAGGCCCTTGGCATCTGGATTAAA.C 
AGAATGA^GCCCTCAGGCAGCCAAGCTCTGTCTTGTCCATCCTCATGTCACG 
CATTTTGGACACTAGCTCTGTCAGCACCCGATCAAAGATGGCTCCTACTCCTG 
CTGAATGGGCTGAGTTGCGGTGCACGTGAAGACCTGTGGCAAGGAGGATGCCA 
TCTCGAACATCAATGGATCGGTGTGAAAAGGAGGCAATGAGGAGTTCATTCCA 

gcctgcccgcagcaatatgacctgatcatccagaggcaaggaggaaaagtgtg 
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GGATCCTCTTCGCCCACTCAACAAGCGTG2\ATAGCTGTTTGTCAGCTGCCTGA 
CAGATGTTAGTCACAGGGTCATTTGGGCTGCTGCCGCTACCCCCGGTTCCCCC 
AGGACCCTCAACGCCCTGGTCACTCTTCTGTTCCACAGCAAGCTCTGCCTCCA 
GGATCCTGTCCACAGGCATCTCGTCGGGGGCTGCCCCAGCCCGCTCCCCATCC 
CCATCCTTGTCCTTTCCCCGCTGACGCTCCTCCTGTACCGCCTCCCTCTTCAT 
GCCAGTGGCCAGGCACTTCTGATAGCGGCAGTACTGACAGCGGTTCCGCTGGC 
GCTTGTCCACTGTGCAGTCTTTGTTGTGCCGGCAAGAGTATGTAAGGTCTTTG 
CGGATGGTGCGTTTGAAGAAGCCCTTGCAACCCTCACAGCTGTAAACCCCGTA 
GTGTTTGCCTGAGCTTCTGTCCCCGCAGATTGCACATAGCCGTTTGCCAGCCC 
CAGGGCCACCTGGAGGGGGTGGACAGTGCAGGCCCCGGACCCCTAAGACTGGT 
GGCTTCACATCTTCAGGGGGGCCAGACCCACCCCCAGGGAGTGACACTGTTGA 
GTTAATCTGGGGGCTGCTGACAGGCCCGGAGAATCCTGGGGGAGCTGGAGGGG 
GCAGACCAGGGGACCCGATGGAAGAACTGATGACTGGAAAGGGAGAGCCCAGT 
GGGGGTGGTGGCATCGGGGGTGGGGGTGGGGCCCCAGAGCCTCCAAGGGTAGG 
AGCTGTTGAAGGGGGTAGGGGTGGCCCAGGAGGAGAAGGGGGAGGGACTCCCT 
GGGGAAGGGGATTTGGGGAGGAGCTGTCTGGGCTTCGGGAGTCCCGCCCGCTG 
TCGCCCATCCCGTCCCGTCCAGCCTCCCCTGGCTCCGGCTCCGGGGTTTGTTG 
TTCTCCGCCTGCCACCGCCGCCGCCGCCGCCGCTGCGGGATCCAGCCAGGGCC 
GTCGCCGCCGCCACCGGGACGCGACCCCACAATGCATTTCTTTTCGCACCGCC 
ACCGGCCCAGACTGCCCTGCGGCATGCCGCTGAGGGAGGAAGGGCGGGCGAGC 
GGCCCAAGACAT 

RXRbeta-Protein: 
SEQ ID NO. 24: 

MSWAARPPFLPQRHAAGQCGPVGVRKEMHCGVASRWRRRRPWLDPAAAAAAAV 
AGGEQQTPEPEPGEAGRDGMGDSGRDSRSPDSSSPNPLPQGVPPPSPPGPPLP 
PSTAPTLGGSGAPPPPPMPPPPLGSPFPVISSSMGSPGLPPPAPPGFSGPVSS 
PQINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPPGGPGAGKRLCAICGDRS. 
SGKHYGWSCEGCKGFFKRTIRKDLTYSCRDFKDCTVDKRQRNRCQYCRYQKC 
LATGMKREAVQEERQRGKDKDGDGEGAGGAPEEMPVDRILEAELAVEQKSDQG 
VEGPGGTGGSGSSPNDPVTNICQAADKQLFTLVEWAKRIPHFSSLPLDDQVIL 
LRAGWNELLIASFSHRSIDVRDGILLATGLHVHRNSAHSAGVGAIFDRVLTEL 
VSKMRDMRMDKTELGCLRAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQ 
KYPEQQGRFAKLLLRLPALRSIGLKCLEHLFFFKLIGDTPIDTFLMEMLEAPH 
QLA 
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RXRqamma (ORR: 

SEQ ID NO. 25: 

ATGTATGGAAATtATTCTCACTTCATGAAGTTTCCCGCAGGCTATGGAGGCTC 

CCCTGGCCACACTGGCTCTACATCCATGAGCCCATCAGCAGCCTTGTCCACAG 

GGAAGCCAATGGACAGCCACCCCAGCTACACAGATACCCGAGTGAGTGCCCCA 

CGGACTCTGAGTGCAGTGGGGACCCCCCTCAATGCCCTGGGCTCTCCATATCG 

AGT(^TCACCTCTGCCATGGGCCCACCCTCAGGAGCACTTGCAGCGCCTCCAa. 

GAATCAACTTGGTTGCCCCACCCAGCTCTCAGCTAAATGTGGTCAACAGTGTC. 

AGCAGTTCAGAGGACATCAAGCCCTTACCAGGGCTTCCCGGGATTGGAAACAT 

GAACTACCCATCCACCAGCCCCGGATCTCTGGTTAAACACATCTGTGCTATCT 

GTGGAGACAGATGCTCAGGAAAGCACTACGGGGTATACAGTTGTGAAGGCTGC 

AAAGGGTTCTTCAAGAGGACGATAAGGAAGGACCTCATCTACACGTGTCGGGA 

TAATAAAGACTGCCTCATTGACAAGCGTCAGCGCAACCGCTGCCAGTACTGTC 

GCTATCAGAAGTGCCTTGTCATGGGCATGAAGAGGGAAGCTGTGCAAGAAGAA 

AGACAGAGGAGCCGAGAGCGAGCTGAGAGTGAGGCAGAATGTGCTACGAGTGG 

TCATGAAGACATGCCTGTGGAGAGGATTCTAGAAGCTGAACTTGCTGTTGAAC , 

CAAAGACAGAATCCTATGGTGACATGAATATGGAGAACTCGACAAATGACCCT 

GTTACCAACATATGTCATGCTGCTGACAAGCAGCTTTTCACCCTCGTTGAATG 

GGCCAAGCGTATTCCCCACTTCTCTGACCTCACCTTGGAGGACCAGGTCATTT 

TGCTTCGGGCAGGGTGGAAfGAATTGCTGATTGCCTCTTTCTCCCACGGCTCA 

GTTTCCGTGCAGGATGGCATCCTTCTGGCCACGGGTTTACATGTCCACCGGAG 

CAGTGCCCACAGTGCTGGGGTCGGCTCCATCTTTGACAGAGTTCTAACTGAGC 

TGGTTTCCAAAATGAAAGACATGCAGATGGACAAGTCGGAACTGGGATGCCTG 

CGAGCCATTGTACTCTTTAACCCAGATGCCAAGGGCCTGTCCAACCCCTCTGA 

GGTGGAGACTCTGCGAGAGAAGGTTTATGCCACCCTTGAGGCGTACACCAAGC 

AGAAGTATCGGGAACAGCCAGGCAGGTTTGCCAAGCTGCTGGTGCGCCTCCCA 

GCTCTGCGTTCCATTGGCTTGAAATGCCTGGAGCACCTCTTCTTCTTCAAGCT 

CATCGGGGACACCCCCATTGACACCTTCCTCATGGAGATGTTGGAGACCCCGC 

TGCAGATCACCTGA 

RXRgamma reverse complement: 
SEQ ID NO. 26: 

TCAGGTGATCTGCAGCGGGGTCTCCAACATCTCCATGAGGAAGGTGTCAATGG 
GGGTGTCCCCGATGAGCTTGAAGAAGAAGAGGTGCTCCIAGGCATTTCAAGCCA 
ATGGAACGCAGAGCTGGGAGGCGCAGCAGCAGCTTGGCAAACCTGCCTGGCTG 
TTCCGGATACTTCTGCTTGGTGTAGGCCTCAAGGGTGGCATAAACCTTCTCTG 
GCAGAGTCTCCACCTCAGAGGGGTTGGACAGGCCCTTGGCATCTGGGTTAAAG 
AGTACAATGGCTCGCAGGCATCCCAGTTCCGACTTGTCCATCTGCATGTCTTT 
CATTTTGGAAACCAGCTC AGTTAGAACTCTGTCAAAGATGGAGCCGAC CCCAG 
CACTGTGGGCACTGCTCCGGTGGACATGTAAACCCGTGGCCAGAAGGATGCCA 
TCGTGCACGGAAACTGAGCGGTGGGAGAAAGAGGCAATCAGCAATTCATTCCA 
CCCTGCCCGAAGCAAAATGACCTGGTCCTCCAAGGTGAGGTCAGAGAAGTGGG 
GAATACGCTTGGCCGATTCAACGAGGGTGAAAAGGTGCTTGTCAGCAGCATGA 
CATATGTTGGTAACAGGGTCATTTGTCGAGTTCTCCATATTCATGTCACCATA 
GGATTCTGTCTTTGGTTCAACAGCAAGTTCAGCTTCTAGAATCCTCTCCACAG 
GCATGTCTTCATGACCACTGGTAGCACATTCTGCCTCACTCTCAGCTCGCTCT 
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CGGCTCCTCTGTCTTTCTTCTTGCACAGCTTCCCTCTTCATGCCCATGACAAG 
GCACTTCTGATAGCGACAGTACTGGCAGCGGTTGCGCTGACGCTTGTCAATGA 
GGCAGTCTTTATTATCCCGACACGTGTAGATGAGGTCCTTCCTTATCGTCCTC 
TTGAAGAACCCTTTGCAGCCTTCACAACTGTATACCCCGTAGTGCTTTCCTGA 
GGATCTGTCTCCACAGATAGCACAGATGTGTTTAACCAGAGATCCGGGGCTGG 
TGGATGGGTAGTTCATGTTTCCAATCCCGGGAAGCCCTGGTAAGGGCTTGATG 
TCCTCTGAACTGCTGACACTGTTGACCACATTTAGCTGAGAGCTGGGTGGGGC 
AACCAAGTTGATTCCTGGAGGCGCTGCAAGTGCTCCTGAGGGTGGGCCCATGG 
CAGAGGTGATGACTCGATATGGAGAGCCCAGGGCATTGAGGGGGGTCCCCACT 
GCACTCAGAGTCCGTGGGGCACTCACTGGGGTATCTGTGTAGCTGGGGTGGGT 
GTCCATTGGCTTCCCTGTGGACAAGGCTGCTGATGGGCTCATGGATGTAGAGC 
CAGTGTGGCCAGGGGAGCCTCCATAGCCTGCGGGAAACTTCATGAAGTGAGAA 
TAATTTCCATACAT 

RXRgamma-Protein: 
SEQ ID NO. 27: 

MYGNYSHFMKFPAGYGGSPGHTGSTSMSPSAALSTGKPMDSHPSYTDTPVSAP 

RTLSAVGTPKNALGSPYRVITSAMGPPSGALAAPPGINLVAPPSSQLNVVNSV 

SSSEDIKPLPGLPGIGNl^SryPSTSPGSLViailCAICGDRSSGKHYGVYSCEG 

KGFFKRTIRKDLIYTCRDNKDCLIDKRQR]^CQYCRYQKCLVMGMKREAVQEE 

RQRSRERAESEAEGATSGHEDMPVERILEAELAVEPKTESYGDMNMENSTNDP 

VTNICHAADKQLFTLVEWAKRI PHFSDLTLEDQVILLRAGWNELLIASFSHRS 

VSVQDGILLATGLHVHRSSAHSAGVGSIFDRVLTELVSKMKDMQMDKSELGCL 

RAIVLFNPDAKGLSNPSEVETLREKVYATLEAYTKQKYPEQPGRFAKLLLRLP 

ALRS IGLKCLEHLFFFKLIGDTPIDTFLMEMLETPLQIT 
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CF44: 

CDNA Sequence 
SEQ ID NO. 28: 

GACTCCCAAGATGGCGGACCTACTGGGCTCCATCCTGAGCTCGATGGAGAAGC 
CACCCAGCCTCGGTGAGCAGGAGACTCGGCGCAAGGCCCGAGAACAGGCCGCC 
CGCCTGAAGAAACTAC^GAGCAAGAGAAACAACAGAAAGTGGAGTTTCGTAA 
AAGGATGGAGAAGGAGGTGTCAGATTTCATTCAAGACAGTGGGCAGATCAAGA 
AAAAGTTTCAGCCAATGAACAAGATCGAGAGGAGCATACTACATGATGTGGTG 
GAAGTGGCTGGCCTGACATCCTTCTCCTTTGGGGAAGATGATGACTGTCGCTA 
TGTCATGATCTTCAAAAAGGAGTTTGCACCCTCAGATGAAGAGCTAGACTCTT 
ACCGTCGTGGAGAGGAATGGGACCCCCAGAAGGCTGAGGAGAAGCGGAAGCTG 
AAGGAGCTGGCCCAGAGGCAAGAGGAGGAGGCAGCCCAGCAGGGGCCTGTGGT 
•GGTGAGCCCTGCCAGCGACTAC!AAGGACAAGTACAGCCACCTCATCGGCAAGG 
GAGCAGCCAAAGACGCAGCCCACATGCTACAGGCCAATAAGACCTACGGCTGT 
GTGCGCGTGGCCAATAAGAGGGAGACACGCTCCATTGAAGAGGCTATGAATGA 
GATCAGAGCCAAGAAGCGTCTGCGGCAGAGTGGGGAAGAGTTGCCGCCAACCT 
CTAGGCGCGCCGCCCAGCTCCCTTTGACCCCTGGGGCAGGGCAGGGGGCAGGG 
AGAGACAAGGCTGCTGCTATTAGAGCCCATCCTGGAGCCCCACCTCTGAACCA 
CCTCCTACCAGCTGTCCCTCAGGCTGGGGGAAAACAGGTGTTTGATTTGTCAC 
CGTTGGAGCTTGGATATGTGCGTGGCATGTGTGTGTGTGTGTGAGAGTGTGAA 

tgcacaggtgggtatttaatctgtattattccccgttcttggaattttcttcc 

CCATGGGGCTGGGGTACTTTACATTCAATAAATACTGTTTAACCCAAAAAAAA 
AAAAAAAAAAAAAAAAAAAA 

SEQ ID NO. 29: 
Reverse complement: 

TTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTTAAACAGTATTTATTGAATGT 
AAAGTACCCCAGCCCCATGGGGAAGAAAATTCCAAGAACGGGGAATAATACAG 
ATTAAATACCCACCTGTGCATTCACACTCTCACACACACACACACATGCCACG 
CACATATCCAAGCTCCAACGGTGACAAATCAAACAGCTGTTTTCCCCCAGCCT 
GAGGGACAGCTGGTAGGAGGTGGTTCAGAGGTGGGGCTCCAGGATGGGCTCTA 
ATAGCAGCAGCCTTGTCTCTCCCTGCCCCCTGCCCTGCCCCAGGGGTCAAAGG 
GAGCTGGGCGGGGCGCCTAGAGGTTGGCGGCAACTCTTCCCCACTCTGCCGCA 
GACGCTTCTTGGCTCTGATCTCATTGATAGCCTCTTCAATGGAGCGTGTGTCC 
CTCTTATTGGCCACGGGCACACAGCCGTAGGTCTTATTGGCCTGTAGGATGTG 
GGCTGCGTCTTTGGCTGCTCCCTTGCCGATGAGGTGGCTGTACTTGTCCTTGT 
AGTCGCTGGCAGGGCTCACCACCACAGGCCCCTGCTGGGCTGCCTGCTCCTCT 
TGCCTCTGGGCCAGCTCCTTCAGCTTCCGCTTCTCCTCAGCCTTCfGGGGGTC 
CCATTCCTCTCCACGACGGTAAGAGTCTAGCTCTTCATCTGAGGGTGCAAACT 
CCTTTTTGAAGATCATGACATAGCGACAGTCATCATCTTCCCCAAAGGAGAAG 
GATGTCAGGCGAGCCACTTCCACCACATCATGTAGTATGCTCCTCTCGATCTT 
GTTCATTGGCTGAAACTTTTTCTTGATCTGCCCACTGTCTTGAATGAAATCTG 
ACACCTCCTTCTCCATCCTTTTACGAAACTCCACTTTCTGTTGTTTCTCTTGC 
TCTTGTAGTTTCTTCAGGCGGGCGGCCTGTTCTCGGGCCTTGCGCCGAGTCTC 
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CTGGTCACCGAGGCTGGGTGGCTTCTCCATGGAGCTCAGGATGGAGGCCAGTA 
GGTCCGCCATCTTGGGAGTC 

SEQ ID NO. 30: 

PROTEIN 

MADLLGS ILS iSMEKPPSLGDQETRRKAREQAARLKKLQEQEKQQKYEFRKRME 
KEVSDF I QDSGQ IKKKFQPMNKI ERS ILHDWEVAGLTS FS FGEDDDCRYVMI 
FKKEFAPSDEELDSYRRGEEWDPQKAEEKRKLKELAQRQEEEAAQQGPVVVSP 
ASDYKDKYSHLIGKGAAKDAAH^QANKTYGWPVANKRDTRSIEEAMNEIRA 
KKRLRQSGEELPPTSRRPAQLPLTPGAGQGAGRDKAAAIRAHPGAPPLNHLLP 
AVPQAGGKQVFDLS PLELGYVRGMCVCV 
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SEQUENCE LISTING 

<110> LION Bioscience AG 

<120> Novel Cof actors of the Pregnane x Receptor and Methods of „ Use 
<130> L-0017-01-WO-01 
<160> 30 

<170> Patentln version 3.0 

<210> 1 

<211> 675 

<212> DNA 

<213> Homo sapiens 

<400> 1 

cttgctctgg ctgtttctgc ccctgggtta acattcaaga tggtacatgc tgaagccttt 60 

tctcgtcctt tgagtcggaa tgaagttgtt ggtttaattt tccgtttgac aatatttggt 120 

gcagtgacat actttactat caaatggatg -gtagatgcaa ttgatccaac cagaaagcaa 180 

aaagtagaag ctcagaaaca ggcagaaaaa ctaatgaagc aaattggagt gaaaaatgtg 240 

aagctctcaa aatatgaaat gagtattgct gctcatcttg tagaccctct taatatgcat 300 

gttacttgga gtgatatagc aggtttagat gatgtcatta cggatctgaa agacacagtc 360 

atcttaccta tcaaaaagaa acatttgttt gagaattcca ggcttctgca gcctccaaaa 420 

ggtgttcttc tctatgggcc tccaggctgg ggtaaaacgt tgattgccaa ggccacagcc 480 

aaagaagcag gctgtccatt tattaacctt cagccttcga cactgaccga taagtggtat 540 

ggagaatctc agaaattggc tgctgctgtc ttatcccttg ccataaagct acaaccatcc 600 

atcatcttta tagatggaaa tagactcctt ttttacgaaa ccgttcaagt tctgaccatg 660 

aaagctacag cccat 675 

\ 

<210> 2 

<211> 675 

<212> DNA 

<213> Homo sapiens 

<400> 2 

atgggctgta gctttcatgg tcagaacttg aacggtttcg taaaaaagga gtctatttcc 60 

atctataaag atgatggatg gttgtagctt tatggcaagg gataagacag cagcagccaa 120 

tttctgagat tctccatacc acttatcggt cagtgtcgaa ggctgaaggt taataaatgg 180 

acagcctgct tctttggctg tggccttggc aatcaacgtt ttaccccagc ctggaggccc 240 

atagagaaga. acaccttttg gaggctgcag aagcctggaa ttctcaaaca aatgtttctt 300 

tttgataggt aagatgactg tgtctttcag atccgtaatg acatcatcta aacctgctat 360 

atcactccaa gtaacatgca tattaagagg gtctacaaga tgagcagcaa tactcatttc 420 
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atattttgag agcttcacat ttttcactcc aatttgcttc attagttttt ctgcctgttt 480 

ctgagcttct actttttgct ttctggttgg atcaattgca tctaccatcc atttgatagt 540 

aaagtatgtc actgcaccaa atattgtcaa acggaaaatt aaaccaacaa cttcattccg 600 

actcaaagga cgagaaaagg cttcagcatg taccatcttg aatgttaacc caggggcaga 660 

aacagccaga gcaag 675 



<210> 3 

<211> 225 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Leu Ala Leu Ala Val Ser Ala Pro Gly Leu Thr Phe Lys Met Val His 
1 5 10 15 

Ala Glu Ala Phe Ser Arg Pro Leu Ser Arg Asn Glu Val Val Gly Leu 
20 25 30 

He. Phe Arg Leu Thr He Phe Gly Ala Val Thr Tyr Phe Thr He Lys 
35 40 45 

Trp Met Val Asp Ala He Asp Pro Thr Arg Lys Gin Lys Val Glu Ala 
50 55 .60 

Gin Lys Gin Ala Glu Lys Leu Met Lys Gin He Gly Val Lys Asn Val 
65 70 75 80 

Lys Leu Ser Lys Tyr Glu Met Ser He Ala Ala His Leu Val Asp Pro 
85 90 95 

Leu Asn Met His Val Thr Trp Ser Asp He Ala Gly Leu Asp Asp Val 
100 105 110 

He Thr Asp Leu Lys Asp Thr Val He Leu Pro He Lys Lys Lys His 
115 120 125 

Leu Phe Glu Asn Ser Arg Leu Leu Gin Pro Pro Lys Gly Val Leu Leu 
130 135 140 

Tyr Gly Pro Pro Gly Trp Gly Lys Thr Leu He Ala Lys Ala Thr Ala 
145 150 155 160 

Lys Glu Ala Gly Cys Pro Phe He Asn Leu Gin Pro Ser Thr Leu Thr 
165 170 175 

Asp Lys Trp Tyr Gly Glu Ser Gin Lys Leu Ala Ala Ala Val Leu Ser 
180 185 190 

Leu Ala He Lys Leu Gin Pro Ser He He Phe He Asp Gly Asn Arg 
195 200 205 

Leu Leu Phe Tyr Glu Thr Val Gin Val Leu Thr Met Lys Ala Thr Ala 
210 215 220 

His 
225 
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<210> 4 

<211> 728 

<212> DNA 

<213> Homo sapiens 



<400> 4 



gatttagtaa 


gtcaccatgt 


gaggacaaaa cttgatgaac 


tgaaaaggca agaagtagga 


60 


aggttaagaa 


tgttaattaa 


agctaagttg gattcccttc 


aagatatagg catggaccac 


120 


caagctcttc 


taaaacaatt 


tgatcaccta aaccacctga 


atcctgacaa gtttgaatcc 


180 


acagatttag 


atatgetaat 


caaageggea acaagtgatc 


tggaacacta tgacaagact 


240 


cgtcatgaag 


aatttaaaaa 


atatgaaatg atgaaggaac 


atgaaaggag agaatattta 


300 


aaaacattga " 


atgaagaaaa 


gagaaaagaa gaagagtcta 


aat 1 t gaaga aa t gaagaaa 


.360 


aagcatgaaa 


atcaccctaa 


agttaatcac ccaggaagca 


aagatcaact aaaagaggta 


420 


tgggaagaga 


ctgatggatt 


ggatcctaat gactttgacc 


ccaagacatt tttcaaatta 


480 


catgatgtca 


atagtgatgg 


attcctggat gaacaagaat 


tagaagcect atttactaaa 


540 


gagttggaga 


aagtttatga 


ccctaaaaat gaaaaggatg 


atatggtaga aatggaagaa 


600 


gaaaggctta 


aaatgaggga 


acatgtaatg aatgaggttg 


atactaacaa agacagattg 


660 


gtgactcttg 


gaggagtttt 


tgaaagecac agaaaaaaaa 


agaatttttg gageccagat 


720 


agctggga 








728 


<210> ,5 

<211> 728 

<212> DNA 

<213> Homo sapiens 








<400> 5 
tcccagctat 


ctgggctcca 


aaaattcttt ttttttctgt 


ggctttcaaa aactcctcca 


60 


agagt caeca 


atctgtcttt 


gttagtatca acctcattca 


ttacatgttc cctcatttta 


120 


agectttett 


cttccatttc 


taccatatca tccttttcafc 


ttttagggtc ataaactttc 


ion 


tccaactctt 


tagtaaatag 


ggcttctaat tcttgttcat 


ccaggaatcc atcactattg 


*1 A ft 


acatcatgta 


atttgaaaaa 


tgtcttgggg tcaaagtcat 


t agga t c caa tec at cagt c 


300 


tcttcccata 


cctcttttag 


ttgatctttg cttcctgggt 


gattaacttt agggtgattt 


360 


teatgetttt 


tcttcatttc 


ttcaaattta gactcttctt 


cttttctctt ttcttcattc 


420 


aatgttttta 


aatattctct 


cctttcatgt tccttcatca 


tttcatattt tttaaattct 


480 


tcatgacgag 


tcttgtcata 


gtgttccaga tcacttgttg 


cegctttgat tagcatatct 


540 


aaatctgtgg 


attcaaactt 


gtcaggattc aggtggttta 


ggtgatcaaa ttgttttaga 


600 


agagcttggt 


ggtccatgcc 


tatatcttga agggaatcca 


acttagcttt aattaacatt 


660 


cttaaccttc 


ctacttcttg 


ccttttcagt tcatcaagtt 


ttgtcctcac atggtgactt 


720 
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<210> 6 

<211> 242 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Asp Leu Val Ser His His Val Arg Thr Lys Leu Asp Glu Leu Lys Arg 
1 5 10 15 

Gin Glu Val Gly Arg Leu Arg Met Leu He Lys Ala Lys Leu Asp. Ser 
20 , 25 30 

Leu Gin Asp lie Gly Met Asp His Gin Ala Leu Leu Lys Gin Phe Asp 
35 40 45 

His Leu Asn His Leu Asn Pro Asp Lys Phe Glu Ser Thr Asp Leu Asp 
50 55 60 

Met Leu He Lys Ala Ala Thr Ser Asp Leu Glu His Tyr Asp Lys Thr 
65' 70 75 BO 

Arg His Glu Glu Phe Lys Lys Tyr Glu Met Met Lys Glu His Glu Arg 
85 90 95 

Arg Glu Tyr Leu Lys Thr Leu Asn Glu Glu Lys Arg Lys Glu Glu Glu 
100 , ~ 105 110 

Ser Lys Phe Glu Glu Met Lys Lys Lys His Glu Asn His Pro Lys Val 
115 120 125 

Asn His Pro Gly Ser Lys Asp Gin Leu Lys Glu Val Trp Glu Glu Thr 
130 135 140 

Asp Gly Leu Asp Pro Asn Asp Phe Asp Pro Lys Thr Phe Phe Lys Leu 
145 150 155 160 

His Asp Val Asn Ser Asp Gly Phe Leu Asp Glu Gin Glu Leu Glu Ala 
165 170 175 

Leu Phe Thr Lys Glu Leu Glu Lys Val Tyr Asp Pro Lys Asn Glu Lys 
180 185 190 

Asp Asp Met Val Glu Met Glu Glu Glu Arg Leu Lys Met Arg Glu His 
195 200 205 

Val Met Asn Glu Val Asp Thr Asn Lys Asp Arg Leu Val Thr Leu Gly 
210 215 220 

Gly Val Phe Glu Ser His Arg Lys Lys Lys Asn Phe Trp Ser Pro Asp 
225 230 235 240 

Ser Trp 



<210> 7 

<211> 660 

<212> DNA 

<213> Homo sapiens 
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<400> 7 
tfcfcgtcagac 


accacgtccg 


cacaaagctg 


gatgagctca 


agcgacagga ggtgtcacgg 


60 


c t g c gg a t g c 


tgctcaaggc 


caagatggac 


gccgagcagg 


atcccaatgt acaggtggat 


120 


catctgaatc 


tcctgaaaca gtttgaacac 


ctggaccctc 


agaaccagca tacattcgag 


180 


gcccgcgacc 


tggagctgct 


gatccagacg 


gccacccggg 


accttgccca gtacgacgca 


240 


acccatcatg 


aaaagttcaa ^gcgctacgag 


atgcttaagg 


aacacgagag acggcgttat 


300 


ctggagtcac 


tgggagagga 


gcagagaaag 


gaggcggaga 


ggaagctgga agagcaacag 


360 


cgccggcacc 


gcgagcaccc 


taaagtcaac 


gtgcctggca 


gccaagccca gttgaaggag 


420 


gtgtgggagg 


agctggatgg 


actggacccc 


aacaggttta 


■ accccaagac^cttcttcata 


ABO 


ctgcatgata 


tcaacagtga 


tggtgtcctg 


gatgagcagg 


agctggaggc actcttcacc 


540 


aaggagctgg 


agaaagtgta 


cgacccaaag 


aatgaggagg 


acgacatgcg ggagatggag 


600 


gaggagcgac 


tgcgcatgct 


gaagcatgtg 


atgaagaatg 


tggacaccca accaggaccg 


660 


<210> 8 . 
<211> 660 
<212> DNA 
<213> Homo sapiens 










<400> 8 
cggtcctggt 


tgggtgtcca 


cattcttcat 


cacatgcttc 


agcatgcgca gtcgctcctc 


60 


ctccatctcc 


cgcatgtcgt 


cctcctcatt 


ctttgggtcg 


tacactttct ccagctcctt 


120 


ggtgaagagt 


gcctccagct 


cctgctcatc 


caggacacca 


tcactgttga tatcatgcag 


180 


tatgaagaag 


gtcttggggt 


taaacctgtt 


ggggtccagt 


ccatccagct cctcccacac 


240 


ctccttcaac 


tgggcttggc 


tgccaggcac 


gttgacttta 


gggtgctcgc ggtgccggcg 


300 


ctgttgctct 


tccagcttcc 


tctccgcctc 


ctttctctgc 


tcctctccca gtgactccag 


360 


ataacgccgt 


ctctcgtgtt 


ccttaagcat 


ctcgtagcgc 


ttgaactttt catgatgggt 


420 


tgcgtcgtac 


tgggcaaggt 


cccgggtggc 


cgtctggatc 


agcagctcca ggtcgcgggc 


480 


ctcgaatgta 


tgctggttct 


gagggtccag 


gtgttcaaac 


tgtttcagga gattcagatg 


540 


atccacctgt 


acattgggat 


cctgctcggc 


gtccatcttg 


gccttgagca gcatccgcag 


600 


ccgtgacacc 


tcctgtcgct 


tgagctcatc 


cagctttgtg 


cggacgtggt gtctgacaaa 


660 


<210> 9 

<211>* 220 

<212> PRT 

<213> Homo sapiens 










<400> 9 












Phe Val Arg His His Val Arg Thr Lys Leu Asp 
1 5 10 


Glu Leu Lys Arg Gin 
15 
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Glu Val Ser Arg Leu Arg Met Leu Leu Lys Ala Lys Met Asp Ala Glu 
20 25 30 

Gin Asp Pro Asn Val Gin Val Asp His Leu Asn Leu Leu Lys Gin Phe 
35 40 45 

Glu His Leu Asp Pro Gin Asn Gin His Thr Phe Glu Ala Arg Asp Leu 
50 55 60 

Glu Leu Leu lie Gin Thr Ala Thr Arg Asp Leu Ala Gin Tyr Asp Ala 
65 70 75 80 

Thr His His Glu Lys Phe Lys Arg Tyr Glu Met Leu Lys Glu His Glu 
85 90 • 95 

Arg Arg Arg Tyr Leu Glu Ser Leu Gly Glu Glu Gin Arg ., Lys Glu Ala 
100 105 110 J 

Glu Arg Lys' Leu Glu Glu Gin Gin Arg Arg His Arg Glu His Pro Lys 
115 120 125 

Val Asn Val Pro Gly Ser Gin Ala Gin Leu Lys Glu Val Trp Glu Glu 
13 0 135 140 

Leu Asp Gly Leu Asp Pro Asn Arg Phe Asn Pro Lys Thr Phe Phe lie 
145 150 155 160 

Leu His Asp lie Asn Ser Asp Gly Val Leu Asp Glu Gin Glu Leu Glu 
165 170 175 

Ala Leu Phe Thr Lys Glu Leu Glu Lys Val Tyr Asp Pro Lys Asn Glu 
180 185 190 

Glu Asp Asp Met Arg Glu Met Glu Glu Glu Arg Leu Arg Met Leu Lys 
195 200 . 205 

His Val Met Lys Asn Val Asp Thr Gin Pro Gly Pro 
210 215 220 

<210> 10 

<211> 670 

<212> DNA 

<213> Homo sapiens v 

<400> 10 



ggggactcgg 


ccctgaacga 


gcaggagaag gagttgcagc ggcggctgaa 


gcgtctctac 


60 


ccggccgagg 


acgaacaaga 


gacgccgctg cctaggtcct ggagcccgaa 


ggacaagttc 


120 


agctacatcg 


gcctctctca 


gaacaacctg cgggtgcact acaaaggtca 


tggcaaaacc 


180 


ccaaaagatg 


ccgcgtcagt 


tcgagccacg catccaatac cagcagcctg 


tgggatttat 


240 


tattttgaag 


taaaaattgt 


cagtaaggga agagatggtt acatgggaat 


tggtctttct 


300 


gctcaaggtg 


tgaacatgaa 


tagactacca ggttgggata agcattcata 


tggttaccat 


360 


ggggatgatg 


gacattcgtt 


ttgttcttct ggaactggac aaccttatgg 


accaactttc 


420 


actactggtg 


atgtcattgg 


ctgttgtgtt aatcttatca acaatacctg 


cttttacacc 


480 


aagaatggac 


atagtttagg 


tattgcttta ctgacctacc gccaaatttg 


tatcctactg 


540 
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tggggcttca aacaccagga gaagtggtcg atgccaattt ttgggcaaca tcctttccgt 600 
gtttgatata aaaaactata tgccgggagt ggagaaccaa aatccaggcc ccagatagat 660 
ccgatttcct 670 



<210> 11 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<400> 11 



aggaaatcgg 


atctatctgg 


ggcctggatt ttggttctcc 


actcccggca 


tatagttttt 


60 


t at at c aaac 


accrcraaacrcra 


tgttgcccaa ^aaattggcat 


cgaccacttc : 


tcctggtgtt 


120 


tgaagcccca 


cagtaggata 


caaatttggc ggtaggtcag 


taaagcaata 


cctaaactat 


180 


gtccattctt 


ggtgtaaaag 


caggtattgt tgataagatt 


aacacaacag 


ccaatgacat 


240 


caccagtagt 


gaaagttggt 


ccataaggtt gtccagttcc 


agaagaacaa 


aacgaatgtc 


300 


catcatcccc 


atggtaacca 


tatgaatgct tatcccaacc 


tggtagtcta ' 


ttcatgttca 


360 


caccttgagc 


agaaagacca 


attcccatgt aaccatctct 


tcccttactg 


acaattttta 


420 


cttcaaaata 


ataaatccca 


caggctgctg gtattggatg 


cgtggctcga 


actgacgcgg 


480 


catcttttgg 


ggttttgcca 


tgacctttgt agtgcacccg 


caggttgttc 


tgagagaggc 


540 


cgatgtagct 


gaacttgtcc 


ttcgggctcc aggacctagg 


cagcggcgtc 


tcttgttcgt 


600 


cctcggccgg 


gtagagacgc 


ttcagccgcc gctgcaactc 


cttctcctgc 


tcgttcaggg 


660 


ccgagtcccc 










670 



<210> 12 

<211> 201 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Gly Asp Ser Ala Leu Asn Glu Gin Glu Lys Glu Leu Gin Arg Arg Leu 
15 10 15 

Lys Arg Leu Tyr Pro Ala Glu Asp Glu Gin Glu Thr Pro Leu Pro Arg 
20 25 30 

Ser Trp Ser Pro Lys Asp Lys Phe Ser Tyr He Gly Leu Ser Gin Asn 
35 40 45 

Asn Leu Arg Val His Tyr Lys Gly His Gly Lys Thr Pro Lys Asp Ala 
50 55 60 

I 

Ala Ser Val Arg Ala Thr His Pro He Pro Ala Ala Cys Gly He Tyr 
65 70 75 80 

Tyr Phe Glu Val Lys He Val Ser Lys Gly Arg Asp Gly Tyr Met Gly 
85 90 95 
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He Gly Leu Ser Ala Gin Gly Val Asn Met Asn Arg Leu Pro Gly Trp 
100 105 110 

Asp Lys His Ser Tyr Gly Tyr His Gly Asp Asp Gly His Ser Phe Cys 
115 120 125 

Ser Ser Gly Thr Gly Gin Pro Tyr Gly Pro Thr Phe Thr Thr Gly Asp 
130 135 140 

Val He Gly Cys Cys Val Asn Leu He Asn Asn Thr Cys Phe Tyr Thr 
145 150 155 160 

Lys Asn Gly His Ser Leu Gly He Ala Leu Leu Thr Tyr Arg Gin He 
165 170 175 

Cys lie Leu Leu Trp Gly Phe Lys His Gin Glu Lys Trp .-Ser Met Pro 
180 185 190 

He Phe Gly Gin His Pro Phe Arg Val 
195 200 



<210> 13 

<211> 1140 

<212> DMA 

<213> Homo sapiens 








<400> 13 
a t* a a c a t* cr t~ a 


a acrcra t crc aa 


gggctttttc 


aggagggcca tgaaacgcaa, cgcccggctg 


60 


aggtgcccct 


tccggaaggg 


cgcctgcgag 


atcacccgga agacccggcg acagtgccag 


120 


gcctgccgcc 


tgcgcaagtg 


cctggagagc 


ggcatgaaga aggagatgat catgtccgac 


180 


gaggccgtgg 


aggagaggcg 


ggc c 1 1 gat c 


aagcggaaga aaagtgaacg gacagggact 


240 


cagccactgg 


gagtgcaggg 


gctgacagag 


gagcagcgga tgatgatcag ggagctgatg 


300 


gacgctcaga 


tgaaaacctt 


tgacactacc 


ttctcccatt tcaagaattt ccggctgcca 


360 


ggggtgctta 


gcagtggctg 


cgagttgcca 


gagtctctgc aggccccatc gagggaagaa 


420 


gctgccaagt 


ggagccaggt 


ccggaaagat 


ctgtgctctt tgaaggtctc tctgcagctg 


480 


cggggggagg 


atggcagtgt 


ctggaactac 


aaacccccag ccgacagtgg cgggaaagag 


540 


atcttctccc 


tgctgcccca 


catggctgac 


atgtcaacct acatgttcaa aggcatcatc 


600 


agctttgcca 


aagtcatctc 


ctacttcagg 


gacttgccca tcgaggacca gatctccctg 


660 


ctgaaggggg 


ccgctttcga 


gctgtgtcaa 


ctgagattca acacagtgtt caacgcggag 


720 


actggaacct 


gggagtgtgg 


ccggctgtcc 


tactgcttgg aagacactgc aggtggcttc 


780 


cagcaacttc 


tactggagcc 


catgctgaaa 


ttccactaca tgctgaagaa gctgcagctg 


840 


catgaggagg 


agtatgtgct 


gatgcaggcc 


atctccctct tctccccaga ccgcccaggt 


900 


gtgctgcagc 


accgcgtggt 


ggaccagctg 


caggagcaat tcgccattac tctgaagtcc 


960 


tacattgaat 


gcaatcggcc 


ccagcctgct 


cataggttct tgttcctgaa gatcatggct 


1020 


atgctcaccg 


agctccgcag 


catcaatgct 


cagcacaccc agcggctgct gcgcatccag 


1080 
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<210> < 14 

<211> 1140 

<212> DNA 

<213> Homo sapiens 

<400> 14 

agtcgatgga cactacggct tgttgaggac gtactccccg catcgtttcc ccacatacag 60 

, gacctacgcg tcgtcggcga cccacacgac tcgtaactac gacgcctcga :gccactcgta 120 

tcggtactag aagtccttgt tcttggatac tcgtccgacc ccggctaacg taagttacat 180 

, cctgaagtct .cattaccgct taacgaggac gtcgaccagg tggtgcgcca cgacgtcgtg 240 

tggacccgcc agacccctct tctccctcta ccggacgtag tcgtgtatga ggaggagtac 300 

gtcgacgtcg aagaagtcgt acatcacctt aaagtcgtac ccgaggtcat cttcaacgac 360 

cttcggtgga cgtcacagaa ggttcgtcat cctgtcggcc ggtgtgaggg tccaaggtca 420 

gaggcgcaac ttgtgacaca acttagagtc aactgtgtcg agctttcgcc gggggaagtc 480 

gtccctctag accaggagct- acccgttcag ggacttcatc ctctactgaa accgtttcga 540 

ctactacgga aacttgtaca tccaactgta cagtcggtac accccgtcgt ccctcttcta 600 

gagaaagggc , ggtgacagcc gacccccaaa catcaaggtc tgtgacggta ggaggggggc 660 

gtcgacgtct ctctggaagt ttctcgtgtc tagaaaggcc tggaccgagg tgaaccgtcg 720 

aagaagggag, ctaccccgga cgtctctgag accgttgagc gtcggtgacg attcgtgggg 780 

accgtcggcc tttaagaact ttaccctctt ccatcacagt ttccaaaagt agactcgcag 840 

gtagtcgagg gactagtagt aggcgacgag gagacagtcg gggacgtgag ggtcaccgac 900 

tcagggacag gcaagtgaaa agaaggcgaa ctagttccgg gcggagagga ggtgccggag 960 

cagcctgtac tagtagagga agaagtacgg cgagaggtcc gtgaacgcgt ccgccgtccg 1020 

gaccgtgaca gcggcccaga aggcccacta gagcgtccgc gggaaggcct tccccgtgga 1080 

gtcggcccgc aacgcaaagt accgggagga ctttttcggg aacgtaggaa gtgtacagta 1140 

<210> 15 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<400> • 15 

Met Glu Val Arg Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val His 
15 10 15 

Cys Glu Asp Thr Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala Asp 
20 25 30 

Glu Glu Val Gly Gly Pro Gin He Cys Arg Val Cys Gly Asp Lys Ala 
35 40 45 
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Thr Gly Tyr His Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe 
50 55 60 

Phe Arg Arg Ala Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg 
65 70 75 80 

Lys Gly Ala Cys Glu lie Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala 
85 90 95 

Cys Arg Leu Arg Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met lie 
100 105 110 

Met Ser Asp Glu Ala Val Glu Glu Arg Arg Ala Leu lie. Lys Arg Lys 
115 120 125 

Lys Ser Glu Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr 
130 , 135 140 

Glu Glu Gin Arg Met Met He Arg Glu Leu Met Asp Ala Gin Met Lys 
145 150 155 160 

Thr Phe, Asp Thr Thr Phe Ser His Phe Lys Asn Phe Arg Leu Pro Gly 
165 170 175 

Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Ser Leu Gin/Ala Pro Ser 
180 185 190 

Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser 
195 200 205 



Leu Lys Val Ser Leu 
210 

Tyr Lys Pro Pro Ala 
225 

Pro His Met Ala Asp 
245 

Phe Ala Lys Val He 
260 

He Ser Leu Leu Lys 
275 

Asn Thr Val Phe Asn 
290 

Ser Tyr Cys Leu Glu 
305 

Glu Pro Met Leu Lys 
325 

Glu Glu Glu Tyr Val 
340 

Arg Pro Gly Val Leu 
355 

Phe Ala lie Thr Leu 
370 



Gin Leu Arg Gly Glu Asp 
215 

Asp Ser Gly Gly Lys Glu 
230 235 

Met Ser Thr Tyr Met Phe 
250 

Ser Tyr Phe Arg Asp Leu 
265 

Gly Ala Ala Phe Glu Leu 
280 

Ala Glu Thr Gly Thr Trp 
295 

Asp Thr Ala Gly Gly Phe 
310 315 

Phe His Tyr Met Leu Lys 
330 

Leu Met Gin Ala He Ser 
345 

Gin His Arg Val Val Asp 
360 

Lys Ser Tyr He Glu Cys 
375 



Gly Ser Val Trp Asn 
220 

He Phe Ser Leu Leu 
240 

Lys Gly lie He Ser 
255 

Pro He Glu Asp Gin 
270 

Cys Gin Leu Arg Phe 
285 

Glu Cys Gly Arg Leu 
300 

Gin Gin Leu Leu Leu 
320 

Lys Leu Gin Leu His 
335 

Leu Phe Ser Pro Asp 
350 

Gin Leu Gin Glu Gin 
365 

Asn Arg Pro Gin Pro 
380 
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Ala His Arg Phe Leu Phe Leu Lys He Met Ala Met Leu Thr Glu Leu 
385 390 395 400 

Arg Ser lie Asn Ala Gin His Thr Gin Arg Leu Leu Arg lie Gin Asp 
405 410 415 

He His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu Phe Gly He Thr 
420 425 430 

Gly Ser 



<21Q> IS 

<211> 990 

<212> DNA 

<213> Homo sapiens 








<40Q> 16 

ggcatgaaga. aggagatgat 


catgtccgac 


gaggccgtgg aggagaggcg ggccttgatc 


60 


aagcggaaga aaagtgaacg 


gacagggact 


cagccactgg gagtgcaggg gctgacagag 


120 


gagcagcgga tgatgatcag 


ggagctgatg 


gacgctcaga . tgaaaacctt tgacactacc 


180 


ttctcccatt tcaagaattt 


ccggctgcca 


ggggtgctta gcagtggctg cgagttgcca 


240 


gagtctctgc aggccccatc 


gagggaagaa 


gctgccaagt ggagccaggt ccggaaagat 


300 


ctgtgctctt . tgaaggtctc 


tctgcagctg 


cggggggagg atggcagtgt ctggaactac 


360 


aaacccccag- ccgacagtgg 


cgggaaagag 


atcttctccc tgctgcccca catggctgac 


420 


atgtcaacct acatgttcaa 


aggcatcatc 


agctttgcca aagtcatctc ctacttcagg 


480 


gacttgccca. tcgaggacca 


gatctccctg 


ctgaaggggg ccgctttcga gctgtgtcaa 


540 


ctgagattca acacagtgtt 


caacgcggag 


actggaacct gggagtgtgg ccggctgtcc 


600 


tactgcttgg aagacactgc 


aggtggcttc 


cagcaacttc tactggagcc catgctgaaa / 


660 


ttccactaca tgctgaagaa 


gctgcagctg 


catgaggagg agtatgtgct gatgcaggcc 


720 


atctccctct tctccccaga 


ccgcccaggt 


gtgctgcagc accgcgtggt- ggaccagctg 


780 


caggagcaat tcgccattac 


tctgaagtcc 


tacattgaat gcaatcggcc ccagcctgct 


840 


cataggttct tgttcctgaa 


gatcatggct 


atgctcaccg agctccgcag catcaatgct 


900 


cagcacaccc agcggctgct 


gcgcatccag 


gacatacacc cctttgctac gcccctcatg 


960 


caggagttgt tcggcatcac 


aggtagctga 




990 


<210> 17 

<211> 990 

<212> DNA 

<213> Homo sapiens 








<400> 17 

tcagctacct gtgatgccga 


acaactcctg 


catgaggggc gtagcaaagg ggtgtatgtc 


60 


ctggatgcgc agcagccgct 


gggtgtgctg 


agcattgatg ctgcggagct cggtgagcat 


120 
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agccatgatc 


ttcaggaaca 


agaacctatg 


agcaggctgg 


ggccgattgc 


attcaatgta 


180 


ggacttcaga 


gtaatggcga 


attgctcctg 


cagctggtcc 


accacgcggt. 


gctgcagcac 


240 


acctgggcgg 


tctggggaga 


agagggagat 


ggcctgcatc 


agcacatact 


cctcctcatg 


300 


cagctgcagc 


ttcttcagca 


tgtagtggaa 


tttcagcatg 


ggctccagta 


gaagttgctg 


360 


gaagccacct 


gcagtgtctt 


ccaagcagta 


ggacagccgg 


ccacactccc 


aggttccagt 


420 


ctccgcgttg 


aacactgtgt 


tgaatctcag 


ttgacacagc 


tcgaaagcgg 


cccccttcag 


480 


cagggagatc 


tggtcctcga 


tgggcaagtc 


cctgaagtag 


gagatgactt 


tggcaaagct 


540 


gatgatgcct 


ttgaacatgt 


aggttgacat 


gtcagccatg 


tggggcagca 


gggagaagat 


600 


ctctttcccg 


ccactgtcgg 


ctgggggttt 


gtagttccag 


acactgccat 


cctccccccg 


660 


cagctgcaga 


gagaccttca 


aagagcacag 


atctttccgg. 


acctggctcc 


acttggcagc 


720 


ttcttccctc 


gatggggcct 


gcagagactc 


tggcaactcg 


cagccactgc 


taagcacccc 


780 


tggcagccgg 


aaattcttga 


aatgggagaa 


ggtagtgtca 


aaggttttca 


tctgagcgtc 


840 


catcagctcc. 


ctgatcatca 


tccgctgctc 


ctctgtcagc 


ccctgcactc 


ccagtggctg 


900 


agtccctgtc 


cgttcacttt 


tcttccgctt 


gatcaaggcc 


cgcctctcct 


ccacggcctc 


960 


gtcggacatg 


atcatctcct 


tcttcatgcc 








990 



<210> 18 

. <211> 329 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Gly Met Lys Lys Glu Met lie Met Ser Asp Glu Ala Val Glu Glu Arg 
1 5 10 15 

Arg Ala Leu He Lys Arg Lys Lys Ser Glu Arg Thr Gly Thr Gin Pro 
20 25 30 

Leu Gly Val Gin Gly Leu Thr Glu Glu Gin Arg Met Met lie Arg Glu 
35 40 45 

Leu .Met Asp Ala Gin Met Lys Thr Phe Asp Thr Thr Phe Ser His Phe 
50 55 60 

Lys Asn Phe Arg Leu Pro Gly Val Leu Ser Ser Gly Cys Glu Leu Pro 
65 70 75 80 

Glu Ser Leu Gin Ala Pro Ser Arg Glu Glu Ala Ala Lys Trp Ser Gin 
85 90 95 

Val Arg Lys Asp Leu Cys Ser Leu Lys Val Ser Leu Gin Leu Arg Gly 
100 105 110 

Glu Asp Gly Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp Ser Gly " Gly 
115 120 125 
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Lys Glu lie Phe Ser Leu Leu Pro His Met Ala Asp Met ■ Ser Thr Tyr 
130 135 140 

Met Phe Lys Gly lie lie Ser Phe Ala Lys Val lie Ser Tyr Phe Arg 
145 150 155 160 

Asp Leu Pro lie Glu Asp Gin lie Ser Leu Leu Lys Gly Ala Ala Phe 
165 170 175 

Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe Asn Ala Glu Thr Gly 
180 185 190 

Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp Thr Ala Gly 
195 ; 200 205 

Gly Phe Gin Gin Leu Leu Leu Glu Pro Met Leu Lys Phe His Tyr Met 
210 215 220 

Leu Lys Lys Leu Gin Leu His Glu Glu Glu Tyr Val Leu Met Gin Ala 
225 ' 230 235 240 

lie Ser Leu Phe Ser Pro Asp Arg Pro Gly Val Leu Gin His Arg Val 
245 250 255 

Val Asp Gin Leu Gin Glu Gin Phe Ala lie Thr Leu Lys Ser Tyr lie 
260 265 270 

Glu Cys Asn Arg Pro Gin Pro Ala His Arg Phe Leu Phe Leu Lys lie 
275 280 285 

Met Ala. Met Leu Thr Glu Leu Arg Ser He Asn Ala Gin His Thr Gin 
290^ 295 300 

Arg Leu Leu Arg He Gin Asp He His Pro Phe Ala Thr Pro Leu Met 
305 310 315 320 

Gin Glu Leu Phe Gly He Thr Gly Ser 
325 

<210> 19 

<211> 1389 

<212> DNA 

<213> Homo sapiens 



<400> 19 
atggacacca 


aacatttcct 


gccgctcgat 


ttctccaccc 


aggtgaactc 


ctccctcacc 


60 


tccccgacgg 


ggcgaggctc 


catggctgcc 


ccctcgctgc 


acccgtccct 


ggggcctggc 


120 


atcggctccc 


cgggacagct 


gcattctccc 


atcagcaccc 


tgagctcccc 


catcaacggc 


180 


atgggcccgc 


ctttctcggt 


catcagctcc 


cccatgggcc 


cccactccat 


gtcggtgccc 


240 


accacaccca 


ccctgggctt 


cagcactggc 


agcccccagc 


tcagctcacc 


tatgaacccc 


300 


gtcagcagca 


gcgaggacat 


caagcccccc 


ctgggcctca 


atggcgtcct 


caaggtcccc 


360 


gcccacccct 


caggaaacat 


ggcttccttc 


accaagcaca 


tctgcgccat 


ctgcggggac 


420 


cgctcctcag 


gcaagcacta 


tggagtgtac 


agctgcgagg 


ggtgcaaggg 


cttcttcaag 


480 


cggacggtgc 


gcaaggacct 


gacctacacc 


tgccgcgaca 


acaaggactg 


cctgattgac 


540 
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aagcggcagc 


ggaaccggtg 


ccagtactgc 


cgctaccaga agtgcctggc catgggcatg 


600 


aagcgggaag 


ccgtgcagga 


ggagcggcag 


cgtggcaagg accggaacga gaatgaggtg 


660 


gagtcgacca 


gcagcgccaa 


cgaggacatg 


ccggtggaga ggatcctgga ggctgagctg 


720 


gccgtggagc 


ccaagaccga 


gacctacgtg 


gaggcaaaca tggggctgaa ccccagctcg 


780 


ccgaacgacc 


ctgtcaccaa 


catttgccaa 


gcagccgaca aacagctttt caccctggtg 


840 


gagtgggcca 


agcggatccc 


acacttctca 


gagctgcccc tggacgacca ggtcatcctg 


900 


ctgcgggcag 


gctggaatga 


gctgctcatc 


gcctccttct . cccaccgctc catcgccgtg 


960 


aaggacggga 


tcctcctggc 


caccgggctg 


cacgtccacc ggaacagcgc ccacagcgca 


1020 


ggggtgggcg 


ccatctttga 


cagggtgctg 


acggagcttg tgtccaagat gcgggacatg 


1080 


cagatggaca 


agacggagct 


gggctgcctg 


cgcgccatcg tcctctttaa ccctgactcc 


1140 


aaggggctct 


cgaacccggc 


cgaggtggag 


gcgctgaggg agaaggtcta tgcgtccttg 


1200 


gaggcctact 


.gcaagcacaa 


gtacccagag 


cagccgggaa ggttcgctaa gctcttgctc 


1260 


cgcctgccgg 


ctctgcgctc 


catcgggctc 


aaatgcctgg aacatctctt cttcttcaag 


.1320 


ctcatcgggg 


acacacccat 


tgacaccttc 


cttatggaga tgctggaggc gccgcaccaa 


1380 


atgacttag 








1389 



<210> 20 

<211> 1389 

<212> DNA 

<213> Homo sapiens 

<400> 20 



ctaagtcatt 


tggtgcggcg 


cctccagcat 


ctccataagg aaggtgtcaa tgggtgtgtc 


60 


cccgatgagc 


ttgaagaaga 


agagatgttc 


caggcatttg agcccgatgg agcgcagagc 


120 


cggcaggcgg 


agcaagagct 


tagcgaacct 


tcccggctgc tctgggtact tgtgcttgca 


180 


gtaggcctcc 


aaggacgcat 


agaccttctc 


cctcagcgcc tccacctcgg ccgggttcga 


240 


gagccccttg 


gagtcagggt 


taaagaggac 


gatggcgcgc aggcagccca gctccgtctt 


300 


gtccatctgc 


atgtcecgca 


tcttggacac 


aagctccgtc agcaccctgt caaagatggc 


360 


gcccacccct 


gcgctgtggg 


cgctgttccg 


gtggacgtgc agcccggtgg ccaggaggat 


420 


cccgtccttc 


acggcgatgg 


agcggtggga 


gaaggaggcg atgagcagct cattccagcc 


480 


tgcccgcagc 


aggatgacct 


ggtcgtccag 


gggcagctct gagaagtgtg ggatccgctt 


540 


ggcccactcc 


accagggtga 


aaagctgttt 


gtcggctgct tggcaaatgt tggtgacagg 


600 


gtcgttcggc 


gagctggggt 


tcagccccat 


gtttgcctcc acgtaggtct cggtcttggg 


660 


ctccacggcc 


agctcagcct 


ccaggatcct 


ctccaccggc atgtcctcgt tggcgctgct 


720 


ggtcgactcc 


acctcattct 


cgttccggtc 


cttgccacgc tgccgctcct cctgcacggc 


780 
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ttcccgcttc 
ctgccgcttg 
caccgtccgc 
tgaggagcgg 

ggggtgggcg 

gctgctgacg 

gggtgtggtg 

cgggcccatg 
ggagccgatg 
cgtcggggag 
ggtgtccat , 



atgcccatgg 
tcaatcaggc 
ttgaagaagc 
tccccgcaga 
gggaccttga 
gggttcatag 
ggcaccgaca 
ccgttgatgg 
ccaggcccca 
gtgagggagg 
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15 

ccaggcactt ctggtagcgg cagtactggc accggttccg 840 

agtccttgtt gtcgcggcag gtgtaggtca ggtccttgcg 900 

ccttgcaccc ctcgcagctg tacactccat agtgcttgcc 960 

tggcgcagat gtgcttggtg aaggaagcca tgtttcctga 1020 

ggacgccatt gaggcccagg gggggcttga tgtcctcgct 1080 

gtgagctgag ctgggggctg ccagtgctga agcccagggt 1140 

tggagtgggg gcccatgggg gage t gat ga ccgagaaagg 1200 

gggagctcag ggtgctgatg ggagaatgea <gctgtcccgg 1260 

gggacgggtg cagegagggg -geagecatgg agcctcgccc .1320 

agttcacctg ggtggagaaa tegageggea ggaaatgttt 1380 

1389 . 



<210> 21 

<211> 462 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Met Asp Thr Lys His Phe Leu Pro Leu Asp Phe Ser Thr Gin Val Asn 
15 10 15 

Ser Ser Leu Thr Ser Pro Thr Gly Arg Gly Ser Met Ala Ala Pro Ser 
20 25 30 

Leu His Pro Ser Leu Gly Pro Gly lie Gly Ser Pro Gly Gin Leu His 
35 40 45 

Ser Pro lie Ser Thr Leu Ser Ser Pro lie Asn Gly Met Gly Pro Pro 
50 55 60 

Phe Ser Val lie Ser Ser Pro Met Gly Pro His Ser Met Ser Val Pro 
65 70 75 80 

Thr Thr Pro Thr Leu Gly Phe Ser Thr Gly Ser Pro Gin Leu Ser Ser 
85 90 95 

Pro Met Asn Pro Val Ser Ser Ser Glu Asp lie Lys Pro Pro Leu Gly 
100 105 110 

Leu Asn Gly Val Leu Lys Val Pro Ala His Pro Ser Gly Asn Met Ala 
115 120 125 

Ser Phe Thr Lys His lie Cys Ala lie Cys Gly Asp Arg Ser Ser Gly 
130 135 140 

Lys His Tyr Gly Val Tyr Ser Cys Glu Gly Cys Lys Gly Phe Phe Lys 
145 150 155 160 

Arg Thr Val Arg Lys Asp Leu Thr Tyr Thr Cys Arg Asp Asn Lys Asp 
165 170 175 
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Cys Leu lie Asp Lys Arg Gin Arg Asn Arg Cys Gin Tyr Cys Arg Tyr 
180 185 190 

Gin Lys Cys Leu Ala Met Gly Met Lys Arg Glu Ala Val Gin Glu Glu 
195 200 205 

Arg Gin Arg Gly Lys Asp Arg Asn Glu Asn Glu Val Glu .Ser Thr Ser 
210 * 215 220 

Ser Ala Asn Glu Asp Met Pro Val Glu Arg He Leu Glu Ala Glu Leu 
225 230 235 240 

Ala Val Glu Pro Lys Thr Glu Thr Tyr Val Glu Ala Asn Met Gly Leu 
245 250 255 

Asn Pro Ser Ser Pro Asn Asp Pro Val Thr Asn lie Cys Gin Ala . Ala 
260 265 270 

Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala Lys Arg lie Pro His 
275 280 285 

Phe Ser Glu Leu Pro Leu Asp Asp Gin Val He Leu Leu Arg Ala Gly 
290 295 300 

Trp, Asn Glu Leu Leu He Ala Ser Phe Ser His Arg Ser lie Ala Val 
305 310 315 320 . 

Lys Asp Gly He Leu Leu Ala Thr Gly Leu His Val His Arg Asn Ser 
325 330 335 

Ala His Ser Ala Gly Val Gly Ala He Phe Asp Arg Val Leu Thr Glu 
340 345 350 

Leu Val Ser Lys Met Arg Asp Met Gin Met Asp Lys Thr Glu Leu Gly 
355 360 365 ' 

Cys Leu Arg Ala He Val Leu Phe Asn Pro Asp Ser Lys Gly Leu Ser 
370 375 380 

Asn Pro Ala Glu Val Glu Ala Leu Arg Glu Lys Val Tyr Ala Ser Leu 
385 390 395 400 

Glu Ala Tyr Cys ^Y 8 His Lys Tyr Pro Glu Gin Pro Gly Arg Phe Ala 

405 410 415 

r 

Lys Leu Leu Leu Arg Leu Pro Ala Leu Arg Ser lie Gly Leu Lys Cys 
420 425 430 

Leu Glu His Leu Phe Phe Phe Lys Leu He Gly Asp Thr Pro lie Asp 
435 440 445 

Thr Phe Leu Met Glu Met Leu Glu Ala Pro His Gin Met Thr 
450 455 460 

<210> 22 

<211> 1602 

<212> DKA 

<213> Homo sapiens 

<400> 22 

atgtcttggg ccgctcgccc gcccttcctc cctcagcggc atgccgcagg gcagtgtggg 60 
ccggtggggg tgcgaaaaga aatgcattgt ggggtcgcgt cccggtggcg gcggcgacgg 120 
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ccctggctgg atcccgcagc 


ggcggcggcg 


gcggcggtgg 


caggcggaga acaacaaacc 


180 


ccggagccgg agccagggga 


ggctggacgg 


gacgggatgg 


gcgacagcgg gcgggactcc 


240 


cgaagcccag acagctcctc 


cccaaatccc 


cttccccagg 


gagtccctcc cccttctcct 


300 


cctgggccac ccctaccccc 


ttcaacagct 


cctacccttg 


gaggctctgg ggccccaccc 


360 


ccacccccga tgccaccacc 


cccactgggc 


tctccctttc 


cagtcatcag ttcttccatg 


420 


gggtcccctg gtctgccccc 


tccagctccc 


ccaggattct 


ccgggcctgt cagcagcccc 


480 


cagattaact caacagtgtc 


actccctggg 


ggtgggtctg 


gcccccctga agatgtgaag 


540 


:ccaccagtct taggggtccg 


gggcctgcac 


tgtccacccc 


ctccaggtgg ccctggggct 


600 


ggcaaacggc tatgtgcaat 


ctgcggggac 


agaagctcag 


gcaaacacta cggggtttac 


660 


agctgtgagg gttgcaaggg 


cttcttcaaa 


cgcaccatcc 


gcaaagacct tacatactct 


720 


tgccgggaca acaaagactg 


cacagtggac 


aagcgccagc 


ggaaccgctg tcagtactgc 


780 


cgctatcaga agtgcctggc 


cactggcatg 


aagagggagg 


cggtacagga ggagcgtcag 


r 840 


cggggaaagg acaaggatgg 


ggatggggag 


ggggctgggg 


gagcccccga ggagatgcct 


900 


gtggacagga tcctggagge 


agagcttgct 


gtggaacaga 


agagtgacca gggcgttgag 


960 


ggtcctgggg gaaccggggg 


tagcggcagc 


agcccaaatg 


accctgtgac taacatctgt 


1020 


caggcagctg acaaacagct 


attcacgctt 


gttgagtggg 


cgaagaggat ^cccacacttt 


; 1080 


tcctccttgc ctctggatga 


tcaggtcata 


ttgctgcggg 


caggctggaa tgaactcctc 


4 1140 


attgcctcct tttcacaccg 


atccattgat 


gttcgagatg 


gcatcctcct tgccacaggt 


1200 


cttcacgtgc accgcaactc 


agcccattca 


gcaggagtag 


gagccatctt tgatcgggtg 


1260 


ctgacagagc tagtgtccaa 


aatgcgtgac 


atgaggatgg 


acaagacaga gcttggctgc 


1320 


ctgagggcaa tcattctgtt 


taatccagat 


gccaagggcc 


tctccaaccc tagtgaggtg 


1380 


gaggtcctgc gggagaaagt 


gtatgcatca 


ctggagacct 


actgcaaaca gaagtaccct 


1440 


gagcagcagg gacggtttgc 


caagctgctg 


ctacgtcttc 


ctgccctccg gtccattggc 


1500 


cttaagtgtc tagagcatct 


gtttttcttc 


aagctcattg 


gtgacacccc catcgacacc 


1560 


ttcctcatgg agatgcttga 


ggctccccat 


caactggcct 


ga 


1602 



<210> 23 

<211> 1602 

<212> DNA 

<213> Homo sapiens 

<400> 23 

tcaggccagt tgatggggag cctcaagcat ctccatgagg aaggtgtcga tgggggtgtc 60 

accaatgagc ttgaagaaaa acagatgctc tagacactta aggccaatgg accggagggc 120 

aggaagacgt agcagcagct tggcaaaccg tccctgctgc tcagggtact tctgtttgca 180 



WO 02/18420 PCT/EP01/09488 

: is 



gtaggtctcc 


agtgatgcat 


acactttctc ccgcaggacc 


tccacctcac 


tagggttgga 


240 


gaggcccttg 


gcatctggat 


.taaacagaat gattgccctc 


aggcagccaa 


gctctgtctt * 


300 


gtccatcctc 


atgtcacgca 


ttttggacac tagctctgtc 


agcacccgat 


caaagatggc 


360 


tcctactcct 


gctgaatggg 


ctgagttgcg gtgcacgtga 


agacctgtgg . 


caaggaggat 


420 


gccatctcga 


acatcaatgg 


atcggtgtga aaaggaggca 


atgaggagtt 


cattccagcc 


480 


tgcccgcagc 


aatatgacct 


gatcatccag aggcaaggag 


gaaaagtgtg 


ggatcctctt 


540 


cgcccactca 


acaagcgtga 


atagctgttt gtcagctgcc 


tgacagatgt 


tagtcacagg 


600 


gtcatttggg 


ctcfctgccgc 


tacccccggt tcccccagga 


ccctcaacgc 


cctggtcact 


660 


cttctgttcc 


acagcaagct 


ctgcctccag gatcctgtcc 


acaggcatct 


cctcgggggc 


720 


tcccccagcc 


ccctccccat 


ccccatcctt gtcctttccc 


cgctgacgct 


cctcctgtac 


780 


cgcctccctc 


ttcatgccag 


tggccaggca cttctgatag 


cggcagtact 


gacagcggtt 


840 


ccgctggcgc 


ttgtccactg 


tgcagtcttt gttgtcccgg 


caagagtatg 


taaggtcttt 


900 


gcggatggtg 


cgtttgaaga 


agcccttgca accctcacag 


i 

ctgtaaaccc 


cgtagtgttt 


960 


gcctgagctt 


ctgtccccgc 


agattgcaca tagccgtttg 


ccagccccag 


ggccacctgg 


1020 


agggggtgga 


cagtgcaggc 


cccggacccc taagactggt 


ggcttcacat cttcaggggg 


1080 


gccagaccca 


cccccaggga 


gtgacactgt tgagttaatc 


tgggggctgc 


tgacaggccc 


1140 


ggagaatcct 


gggggagctg 


gagggggcag accaggggac 


cccatggaag 


aactgatgac 


1200 


tggaaaggga 


gagcccagtg 


ggggtggtgg catcgggggt 


gggggtgggg 


ccccagagcc 


1260 


tccaagggta 


ggagctgttg 


aagggggtag gggtggccca ggaggagaag ggggagggac 


1320 


tccctgggga 


aggggatttg 


gggaggagct gtctgggctt 


cgggagtccc 


gcccgctgtc 


1380 


gcccatcccg 


tcccgtccag 


cctcccctgg ctccggctcc ggggtttgtt gttctccgcc 


1440 


tgccaccgcc 


gccgrccgccg 


ccgctgcggg atccagccag ggccgtcgcc gccgccaccg 


1500 


ggacgcgacc 


ccacaatgca 


tttcttttcg cacccccacc 


ggcccacact 


gccctgcggc 


1560 


atgccgctga 


gggaggaagg 


gcgggcgagc ggcccaagac 


at 




1602 



<210> 24 

<211> 533 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Ser Trp Ala Ala Arg Pro Pro Phe Leu Pro Gin Arg His Ala Ala 
1 5 10 15 

Gly Gin Cys Gly Pro Val Gly Val Arg Lys Glu Met His Cys Gly Val 
20 25 30 
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Ala Ser Arg Trp Arg Arg Arg Arg Pro Trp Leu Asp Pro Ala Ala Ala 
35 40 45 

Ala Ala Ala Ala Val Ala Gly Gly Glu Gin Gin Thr Pro Glu Pro Glu 
50 55 60 

Pro Gly Glu Ala Gly Arg Asp Gly Met Gly Asp Ser Gly Arg Asp Ser 
65 70 75 80 

Arg Ser Pro Asp Ser Ser Ser Pro Asn Pro Leu Pro Gin Gly Val Pro 
85 90 95 

Pro Pro Ser Pro Pro Gly Pro Pro Leu Pro Pro Ser Thr Ala Pro Thr 
100 105 110 

Leu Gly Gly . Ser Gly Ala Pro Pro Pro Pro Pro Met Pro Pro Pro Pro 
115 120 125 

Leu Gly Ser Pro Phe Pro Val lie Ser Ser Ser Met Gly Ser Pro Gly 
130 135 140 

Leu Pro Pro Pro Ala Pro Pro Gly Phe Ser Gly Pro Val Ser Ser Pro 
145 150 155 160 

Gin lie Asn Ser Thr Val Ser Leu Pro Gly Gly Gly Ser Gly Pro Pro 
165 170 175 

Glu Asp Val Lys Pro Pro Val Leu Gly Val Arg Gly Leu His Cys Pro 
180 185 190 

Pro Pro Pro Gly Gly Pro Gly Ala Gly Lys Arg Leu Cys Ala lie Cys 
195 200 205 

Gly "Asp Arg Ser Ser Gly Lys His Tyr Gly Val Tyr Ser Cys Glu Gly 
210 215 220 

Cys Lys Gly Phe Phe Lys Arg Thr lie Arg Lys Asp Leu Thr Tyr Ser 
225 230 235 240 

Cys Arg Asp Asn Lys Asp Cys Thr Val Asp Lys Arg Gin Arg Asn Arg 
245 250 255 

Cys Gin Tyr Cys Arg Tyr Gin Lys Cys Leu Ala Thr Gly Met Lys Arg 
260 265 270 

Glu Ala Val Gin Glu Glu Arg Gin Arg Gly Lys Asp Lys Asp Gly Asp 
275 280 285 

Gly Glu Gly Ala Gly Gly Ala Pro Glu Glu Met Pro Val Asp Arg lie 
290 295 300 

Leu "Glu Ala Glu Leu Ala Val Glu Gin Lys Ser Asp Gin Gly Val Glu 
305 310 315 320 

Gly Pro Gly Gly Thr Gly Gly Ser Gly Ser Ser Pro Asn Asp Pro Val 
325 330 335 

Thr Asn lie Cys Gin Ala Ala Asp Lys Gin Leu Phe Thr Leu Val Glu 
340 345 350 

Trp Ala Lys Arg lie Pro His Phe Ser Ser Leu Pro Leu Asp Asp Gin 
355 360 365 
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Val lie Leu Leu Arg Ala Gly Trp Asn Glu Leu Leu lie Ala Ser Phe 
370 375 380 

Ser His Arg Ser He Asp Val Arg Asp Gly He Leu Leu Ala Thr Gly 
385 390 395 400 

Leu His Val His Arg Asn Ser Ala His Ser Ala Gly Val Gly Ala lie 
405 410 415 

Phe Asp Arg Val Leu Thr Glu Leu Val Ser Lys Met Arg Asp Met Arg 
420 425 430 

Met Asp Lys Thr Glu Leu Gly Cys Leu Arg Ala He He Leu Phe Asn 
435 440 445 

Pro Asp Ala Lys Gly Leu Ser Asn Pro Ser Glu Val Glu Val Leu Arg 
450 455 460 

Glu Lys Val Tyr Ala Ser Leu Glu Thr Tyr Cys Lys Gin Lys Tyr Pro 
465 470 475 480 

Glu Gin Gin Gly Arg Phe Ala Lys Leu Leu Leu Arg Leu Pro Ala Leu 
485 490 495 

Arg Ser He Gly Leu Lys Cys Leu Glu His Leu Phe Phe Phe Lys Leu 
500 505 510 

He Gly Asp Thr Pro He Asp Thr Phe Leu Met Glu Met Leu Glu Ala 
515 520 525 

Pro His Gin Leu Ala 
530 

<210>. 25 

<211> 1392 

<212> DNA 

<213> Homo sapiens 

<400> 25 

atgtatggaa attattctca cttcatgaag tttcccgcag gctatggagg ctcccctggc 60 

cacactggct ctacatccat gagcccatca gcagccttgt ccacagggaa gccaatggac 120 

agccacccca gctacacaga taccccagtg agtgccccac ggactctgag tgcagtgggg 160 * 

acccccctca atgccctggg ctctccatat cgagtcatca cctctgccat gggcccaccc 240 

tcaggagcac ttgcagcgcc tccaggaatc aacttggttg ccccacccag ctctcagcta 300 

aatgtggtca acagtgtcag cagttcagag gacatcaagc ccttaccagg gcttcccggg 360 # 

attggaaaca tgaactaccc atccaccagc cccggatctc tggttaaaca catctgtgct 420 

atctgtggag acagatcctc aggaaagcac tacggggtat acagttgtga aggctgcaaa 480 

gggttcttca agaggacgat aaggaaggac ctcatctaca cgtgtcggga taataaagac 540 

tgcctcattg acaagcgtca gcgcaaccgc tgccagtact gtcgctatca gaagtgcctt 600 

gtcatgggca tgaagaggga agctgtgcaa gaagaaagac agaggagccg agagcgagct 660 

gagagtgagg cagaatgtgc taccagtggt catgaagaca tgcctgtgga gaggattcta 720 
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gaagctgaac 


ttgctgttga 


accaaagaca 


tcgacaaatg 


accctgttac 


caacatatgt 


gttgaatggg 


ccaagcgtat 


tccccacttc 


ttgcttcggg 


cagggtggaa 


tgaattgctg 


gtgcaggatg 


gcatccttct 


ggccacgggt 


gctggggtcg 


gctccatctt 


tgacagagtt 


atgcagatgg 


acaagtcgga 


actgggatgc 


gccaagggcc 


tgtccaaccc 


ctctgaggtg 


cttgaggcct 


acaccaagca 


gaagtatccg 


ctgcgcctcc 


cagctctgcg 


ttccattggc 


aagctcatcg 


gggacacccc 


cattgacacc 


cagatcacct 


ga 




<210> 26 

<211> 1392 

<212> DNA 

<213> Homo sapiens 




<400> 26 
tcaggtgatc 


tgcagcgggg 


tctccaacat 


cccgatgagc 


ttgaagaaga 


agaggtgctc 


tgggaggcgc 


agcagcagct 


tggcaaacct 


gtaggcctca 


agggtggcat 


aaaccttctc 


caggcccttg 


gcatctgggt 


taaagagtac 


gtccatctgc 


atgtctttca 


ttttggaaac 


gccgacccca 


gcactgtggg 


cactgctccg 


gccatcctgc 


acggaaactg 


agcggtggga 


tgcccgaagc 


aaaatgacct 


ggtcctccaa 


ggcccattca 


acgagggtga 


aaagctgctt 


gtcatttgtc 


gagttctcca 


tattcatgtc 


aagttcagct 


tctagaatcc 


tctccacagg 


tgcctcactc 


tcagctcgct 


ctcggctcct 


catgcccatg 


acaaggcact 


tctgatagcg 


gtcaatgagg 


cagtctttat 


tatcccgaca 


cttgaagaac 


cctttgcagc 


cttcacaact 
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gaatcctatg 


gtgacatgaa 


tatggagaac 


780 


catgctgctg 


acaagcagct 


tttcaccctc 


840 


tctgacctca 


ccttggagga 


ccaggtcatt 


900 


attgcctctt 


tctcccaccg 


ctcagtttcc 


960 


ttacatgtcc 


accggagcag 


tgcccacagt 


1020 


ctaactgagc 


tggtttccaa 


aatgaaagac 


1080 


ctgcgagcca 


ttgtactctt 


taacccagat 


1140 


gagactctgc 


gagagaaggt 


ttatgccacc 


1200 


gaacagccag 


gcaggtttgc 


caagctgctg 


1260 


ttgaaatgcc 


tggagcacct 


cttcttcttc 


1320 


ttcctcatgg 


agatgttgga 


gaccccgctg 


1380 



1392 



ctccatgagg 


aaggtgtcaa 


-tgggggtgtc 


60 


caggcatttc 


aagccaatgg 


aacgcagagc 


120 


gcctggctgt 


tccggatact 


tctgcttggt 


180 


tcgcagagtc 


tccacctcag 


aggggttgga 


240 


aatggctcgc 


aggcatccca 


gttccgactt 


300 


cagctcagtt 


agaactctgt 


caaagatgga 


360 


gtggacatgt 


aaacccgtgg 


ccagaaggat 


420 


gaaagaggca 


atcagcaatt 


cattccaccc 


480 


ggtgaggtca 


gagaagtggg 


gaatacgctt 


540 


gtcagcagca 


tgacatatgt 


tggtaacagg 


600 


accataggat 


tctgtctttg 


gttcaacagc 


660 


catgtcttca 


tgaccactgg 


tagcacattc 


720 


ctgtctttct 


tcttgcacag 


cttccctctt 


780 


acagtactgg 


cagcggttgc 


gctgacgctt 


840 


cgtgtagatg 


aggtccttcc 


ttatcgtcct 


900 


gtataccccg 


tagtgctttc 


ctgaggatct 


960 
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gtctccacag atagcacaga tgtgtttaac cagagatccg gggctggtgg atgggtagtt 1020 

catgtttcca atcccgggaa gccctggtaa gggcttgatg tcctctgaac tgctgacact 1080 

gttgaccaca tttagctgag agctgggtgg ggcaaccaag ttgattcctg gaggcgctgc 1140 

aagtgctcct gagggtgggc ccatggcaga ggtgatgact cgatatggag agcccagggc 1200 

attgaggggg gtccccactg cactcagagt ccgtggggca ctcactgggg tatctgtgta 1260 

gctggggtgg ctgtccattg gcttccctgt ggacaaggct gctgatgggc tcatggatgt 1320 

agagccagtg tggccagggg agcctccata gcctgcggga aacttcatga agtgagaata 1380 

atttccatac at 1392 * 



<210> 27 

<211> 463 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Met Tyr Gly Asn Tyr Ser His Phe Met Lys Phe Pro Ala .Gly Tyr Gly 
1 5 10 15 

Gly Ser Pro Gly His Thr Gly Ser Thr Ser Met Ser Pro Ser Ala Ala 
20 25 30 

Leu Ser Thr Gly Lys Pro Met Asp Ser His Pro Ser Tyr Thr Asp Thr 
35 40 45 

Pro Val Ser Ala Pro Arg Thr Leu Ser Ala Val Gly Thr Pro Leu Asn 
50 55 60 

Ala Leu Gly Ser Pro Tyr Arg Val lie Thr Ser Ala Met Gly Pro Pro 
65 70 75 80 

Ser Gly Ala * Leu Ala Ala Pro Pro Gly lie Asn Leu Val Ala Pro Pro 
85 * 90 95 

Ser Ser Gin Leu Asn Val Val Asn Ser Val Ser Ser Ser Glu Asp lie 
; 100 . 105 110 

Lys Pro Leu Pro Gly Leu Pro Gly lie Gly Asn Met Asn Tyr Pro Ser 
115 120 125 

Thr Ser Pro Gly Ser Leu Val Lys His lie Cys Ala He Cys Gly Asp 
130 135 140 

Arg Ser Ser Gly Lys His Tyr Gly Val Tyr Ser Cys Glu Gly Cys Lys 
145 150 155 160 

Gly Phe Phe Lys Arg Thr He Arg Lys Asp Leu He Tyr Thr Cys Arg 
165 170 175 

Asp Asn Lys Asp Cys Leu He Asp Lys Arg Gin Arg Asn Arg Cys Gin 
180 185 190 

Tyr Cys Arg Tyr Gin Lys Cys Leu Val Met Gly Met Lys Arg Glu Ala 
195 200 205 
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Val Gin Glu Glu Arg Gin Arg Ser Arg Glu Arg Ala Glu Ser Glu Ala 
210 215 220 

Glu Cys Ala Thr Ser Gly His Glu Asp Met Pro Val Glu Arg He Leu 
225 230 235 240 

Glu Ala Glu Leu Ala Val Glu Pro Lys Thr Glu Ser Tyr Gly Asp Met 
245 250 255 

Asn Met Glu Asn Ser Thr Asn Asp Pro Val Thr Asn He Cys His Ala 
260 265 270 

Ala Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala Lys \Arg lie Pro 
275 280 285 

His Phe Ser Asp Leu Thr Leu Glu Asp Gin Val lie Leu Leu Arg Ala 
290 295 300 

Gly Trp Asn Glu Leu Leu He Ala Ser Phe Ser His Arg Ser Val Ser 
305 310 315 320 

Val Gin Asp Gly lie Leu Leu Ala Thr Gly Leu His Val His Arg Ser ' 
325 330 335 

Ser Ala His Ser Ala Gly Val Gly Ser lie Phe Asp Arg. Val Leu Thr 
340 345 350 

Glu Leu Val Ser Lys Met Lys Asp Met Gin Met Asp Lys Ser Glu Leu 
355 360 365 

Gly Cys Leu Arg Ala lie Val Leu Phe Asn Pro Asp Ala Lys Gly Leu 
370 375 380 

Ser Asn Pro Ser Glu Val Glu Thr Leu Arg Glu Lys Val Tyr Ala Thr 
385 390 395 400 

Leu Glu Ala Tyr Thr Lys Gin Lys Tyr Pro Glu Gin Pro Gly Arg Phe 
405 410 415 

Ala Lys Leu Leu Leu Arg Leu Pro Ala Leu Arg Ser He Gly Leu Lys 
420 425 430 

Cys Leu Glu His Leu Phe Phe Phe Lys Leu lie Gly Asp Thr Pro He 
.435 440 445 

Asp Thr Phe Leu Met Glu Met Leu Glu Thr Pro Leu Gin lie Thr 
450 455 460 

<210> 28 

<211> 1027 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gactcccaag atggcggacc tactgggctc catcctgagc tccatggaga agccacccag 60 

cctcggtgac caggagactc ggcgcaaggc ccgagaacag gccgcccgcc tgaagaaact 120 

acaagagcaa gagaaacaac agaaagtgga gtttcgtaaa aggatggaga aggaggtgtc 180 

agatttcatt caagacagtg ggcagatcaa gaaaaagttt cagccaatga acaagatcga 240 

gaggagcata ctacatgatg tggtggaagt ggctggcctg acatccttct cctttqggga 300 
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agatgatgac 


tgtcgctatg 


tcatgatctt 


caaaaaggag 


tttgcaccct 


cagatgaaga 


360 


gctagactct* taccgtcgtg 


gagaggaatg 


ggacccccag 


aaggctgagg 


agaagcggaa 


420 


gctgaaggag 


ctggcccaga 


ggcaagagga 


ggaggcagcc 


cagcaggggc 


ctgtggtggt 


480 


gagccctgcc 


agcgactaca 


aggacaagta 


cagccacctc 


atcggcaagg gagcagccaa 


540 


agacgcagcc 


cacatgctac 


aggccaataa 


gacctacggc tgtgtgcccg tggccaataa 


600 


gagggacaca 


cgctccattg 


aagaggctat 


gaatgagatc 


agagccaaga agcgtctgcg 


660 


gcagagtggg 


gaagagttgc 


cgccaacctc 


taggcgcccc gcccagctcc 


ctttgacccc 


720 


tggggcaggg 


cagggggcag 


ggagagacaa 


crcrc t cr c tcrc t 


attagagccc atcctggagc 


780 


cccacctctg 


aaccacctcc 


taccagctgt 


ccet* cacrcicfc 

v \* ^ ^4 


gggggaaaac 


acrcr t crt 1 1 era 


840 


tttgtcaccg 


ttggagcttg 


gatatgtgcg 


tggcatgtgt 


crtcrfccrfccrtcrt 

3 _j J ZJ 


craQacr t crt era 


900 


atgcacaggt 


gggtatttaa 


tctgtattat 


tccccgttct 


tggaattttc 


ttccccatgg 


960 


ggctggggta 


ctttacattc 


aataaatact 


gtttaaccca 


aaaaaaaaaa * 


aaaaaaaaaa 


1020 


aaaaaaa 












1027 


<210> 29 

<211> 1027 

<212> DNA 

<213> Homo sapiens 












<400> 29 
tttttttttt 


tttttttttt 


ttttttttgg 


gttaaacagt 


atttattgaa 


tgtaaagtac 


60 


cccagcccca 


tggggaagaa 


aattccaaga 


acggggaata 


atacagatta 


aatacccacc 


120 


tgtgcattca 


cactctcaca 


cacacacaca 


catgccacgc 


acatatecaa 


gctccaacgg 


180 


tgacaaatca 


aacacctgtt 


ttcccccagc 


ctgagggaca 


gctggtagga 


ggtggttcag 


240 


aggtggggct 


ccaggatggg 


ctctaatagc 


agcagccttg 


tctctccctg 


ccccctgccc 


300 


tgccccaggg 


gtcaaaggga 


gctgggcggg 


gcgcctagag 


gttggcggca 


actcttcccc . 


360 


actctgccgc 


agacgcttct 


tggctctgat 


ctcattcata 


gcctcttcaa 


tggagcgtgt 


420 


gtccctctta 


ttggccacgg 


gcacacagcc 


gtaggtctta 


ttggcctgta 


gcatgtgggc 


480 


tgcgtctttg 


gctgctccct 


tgccgatgag 


gtggctgtac 


ttgtccttgt 


agtcgctggc 


540 


agggctcacc 


accacaggcc 


cctgctgggc 


tgcctcctcc 


tcttgcctct 


gggccagctc 


600 


cttcagcttc 


cgcttctcct 


cagccttctg 


ggggtcccat 


tcctctccac 


gaeggtaaga 


660 


gtctagctct 


tcatctgagg 


gtgcaaactc 


ctttttgaag 


atcatgacat 


agegacagtc 


720 


atcatcttcc 


ccaaaggaga 


aggatgtcag 


gccagccact 


tccaccacat 


catgtagtat 


780 


gctcctctcg 


atcttgttca 


ttggctgaaa 


ctttttcttg 


atctgcccac 


tgtcttgaat 


840 


gaaatctgac 


acctccttct 


ccatcctttt 


acgaaactcc 


actttctqtt 


gtttctcttcr 


900 
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ctcttgtagt ttcttcaggc gggcggcctg ttctcgggcc ttgcgccgag tctcctggtc 960 
accgaggctg ggtggcttct ccatggagct caggatggag cccagtaggt ccgccatctt 1020 
gggagtc 1027 



<210> 30 

<211> 293 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Ala Asp Leu Leu Gly Ser lie Leu Ser Ser Met Glu:LysvPro Pro 
1 5 10 15 

Ser Leu Gly Asp Gin Glu Thr Arg Arg Lys Ala Arg Glu Gin Ala Ala 
20 25 30 

Arg Leu Lys Lys Leu Gin Glu Gin Glu Lys Gin Gin Lys Val Glu Phe 
35 40 45' 

Arg Lys Arg Met Glu Lys Glu Val Ser Asp Phe He Gin Asp Ser Gly 
50 55 60 

Gin He Lys Lys Lys Phe Gin Pro Met Asn Lys lie Glu Arg Ser He 
65 70 75 80 

Leu His Asp Val Val Glu Val Ala Gly Leu Tjir Ser Phe Ser Phe Gly 
85 90 95 

Glu Asp Asp Asp Cys Arg Tyr Val Met He Phe Lys Lys Glu Phe Ala 
100 105 110 

Pro Ser Asp Glu Glu Leu Asp Ser Tyr Arg Arg Gly Glu Glu Trp Asp 
115 120 125 

Pro Gin Lys Ala Glu Glu Lys Arg Lys Leu Lys Glu Leu Ala Gin Arg 
130 135 140 

Gin Glu Glu Glu Ala Ala Gin Gin Gly Pro Val Val Val Ser Pro Ala 
145 150 155 160 

Ser Asp Tyr Lys Asp Lys Tyr Ser His Leu He Gly Lys Gly Ala Ala 
165 170 175 

Lys Asp Ala Ala His Met Leu Gin Ala Asn Lys Thr Tyr Gly Cys Val 
180 185 190 

Pro Val Ala Asn Lys Arg Asp Thr Arg Ser He Glu Glu Ala Met Asn 
195 200 ? 205 

Glu He Arg Ala Lys Lys Arg Leu Arg Gin Ser Gly Glu Glu Leu Pro 
210 215 220 

Pro Thr Ser Arg Arg Pro Ala Gin Leu Pro Leu Thr Pro Gly Ala Gly 
225 230 235 240 

Gin Gly Ala Gly Arg Asp Lys Ala Ala Aia He Arg Ala His Pro Gly 
245 250 255 
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Ala Pro Pro Leu Asn His Leu Leu Pro Ala Val Pro Gin Ala Gly Gly 
260 265 270 

Lys Gin Val Phe Asp Leu Ser Pro Leu Glu Leu Gly Tyr Val Arg Gly 
275 280 285 

Met Cys Val Cys Val 
290 
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